Update en_GB speller
Hello! Could someone update the GB speller to my version 2.40: https://github.com/marcoagpinto/aoo-mozilla-en-dict ? Thanks! Kind regards, >Marco A.G.Pinto -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Fwd: Yakov! (rule still giving error)
Thanks, Yakov and Jan! Kind regards, >Marco A.G.Pinto --- On 17/08/2016 20:37, Yakov Reztsov wrote: > Another variant of rule: > > > > > > há > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? > atrás > > Com o verbo haver não é necessário usar "atrás": > no="2"/>. > Há n segundos > atrás. > > > > Среда, 17 августа 2016, 21:49 +03:00 от "Marco A.G.Pinto": > > Hello! > > I sent a private message to Yakov but maybe he didn't receive it, > so I am sending it to the mailing list: > > The rule still gives an error: > > > > > há > > > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? > atrás > > Com o verbo haver não é necessário usar "atrás": > > . > Há n segundos > atrás. > > > TESTRULES PT says: > > > What shall I do? > > Thanks! > > Kind regards from your friend, > >Marco A.G.Pinto >--- > > -- > > > -- > > Yakov Reztsov -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Fwd: Yakov! (rule still giving error)
Hello! I sent a private message to Yakov but maybe he didn't receive it, so I am sending it to the mailing list: The rule still gives an error: há segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? atrás Com o verbo haver não é necessário usar "atrás": . Há n segundos atrás. TESTRULES PT says: What shall I do? Thanks! Kind regards from your friend, >Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help improve rule pt_PT
Thanks, Jan, I use "1" and "3" but TESTRULES PT gives an error in: Com o verbo haver não é necessário usar "atrás": \1 \2 \3. Can you help? Thanks! On 17/08/2016 12:36, Jan Schreiber wrote: > Marco, > > I think you could replace > > from your original version with the following: > > (Replace 0 with 1 if necessary.) > > Best, > Jan > > Am 17.08.2016 um 13:25 schrieb Marco A.G.Pinto: >> Yakov, >> >> It didn't work. >> >> I tried it in the stand-alone tool after using *TESTRULES PT*, with the >> sentences: >> *Isso aconteceu à quase 20 anos!** >> **Isso aconteceu à quase vinte e três anos!** >> **Há 10 anos atrás** >> **Há quarenta e dois anos atrás* >> >> Rules I changed: >> *** >> **** >> ** ** >> **há** >> **** >> **> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** >> **atrás** >> ** ** >> ** Com o verbo haver não é necessário usar "atrás": >> \1 \2 \3.** >> ** Há n segundos >> atrás.** >> **** >> ** >> ** >> **** >> **** >> **** >> ** ** >> **** >> **à** >> ** ** >> **quase** >> **** >> **> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** >> ** ** >> ** Substituir «à» por há.** >> ** Conheço a Ana à quase >> 30 anos.** >> ** * >> >> >> Thanks! >> >> Kind regards, >>>Marco A.G.Pinto >> --- >> >> >> On 17/08/2016 11:35, Yakov Reztsov wrote: >>> Hello! >>> One of the options is add *skip="1":* >>> >>> >>> >>> >>> há >>> >>> >> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? >>> atrás >>> >>> Com o verbo haver não é necessário usar "atrás": >>> \1 \2 \3. >>> Há n segundos >>> atrás. >>> >>> >>> Среда, 17 августа 2016, 10:59 +03:00 от "Marco A.G.Pinto" <>: >>> >>> Hello! >>> >>> I want to improve the following rule: >>> >>> *** >>> **** >>> ** ** >>> **há** >>> **** >>> **>> >>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** >>> **atrás** >>> ** ** >>> ** Com o verbo haver não é necessário usar "atrás": >>> \1 \2 \3.** >>> ** Há n segundos >>> atrás.** >>> *** >>> >>> >>> I want the token after "há" to accept up to three words. >>> >>> For example: >>> "há *vinte e três* dias atrás" >>> >>> What changes shall I make? >>> >>> Thanks! >>> >>> Kind regards, >>> >Marco A.G.Pinto >>>--- >>> >>> -- >>> >>> >>> -- >>> >>> Yakov Reztsov >> -- >> >> -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help improve rule pt_PT
Yakov, It didn't work. I tried it in the stand-alone tool after using *TESTRULES PT*, with the sentences: *Isso aconteceu à quase 20 anos!** **Isso aconteceu à quase vinte e três anos!** **Há 10 anos atrás** **Há quarenta e dois anos atrás* Rules I changed: *** **** ** ** **há** **** **segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** **atrás** ** ** ** Com o verbo haver não é necessário usar "atrás": \1 \2 \3.** ** Há n segundos atrás.** **** ** ** **** **** **** ** ** **** **à** **** **quase** **** **segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** ** ** ** Substituir «à» por há.** ** Conheço a Ana à quase 30 anos.** ** * Thanks! Kind regards, >Marco A.G.Pinto --- On 17/08/2016 11:35, Yakov Reztsov wrote: > Hello! > One of the options is add *skip="1":* > > > > > há > > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos? > atrás > > Com o verbo haver não é necessário usar "atrás": > \1 \2 \3. > Há n segundos > atrás. > > > Среда, 17 августа 2016, 10:59 +03:00 от "Marco A.G.Pinto" <>: > > Hello! > > I want to improve the following rule: > > *** > **** > ** ** > **há** > **** > ** > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** > **atrás** > ** ** > ** Com o verbo haver não é necessário usar "atrás": > \1 \2 \3.** > ** Há n segundos > atrás.** > *** > > > I want the token after "há" to accept up to three words. > > For example: > "há *vinte e três* dias atrás" > > What changes shall I make? > > Thanks! > > Kind regards, > >Marco A.G.Pinto >--- > > -- > > > -- > > Yakov Reztsov -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Help improve rule pt_PT
Hello! I want to improve the following rule: *** **** ** ** **há** **** **segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?** **atrás** ** ** ** Com o verbo haver não é necessário usar "atrás": \1 \2 \3.** ** Há n segundos atrás.** *** I want the token after "há" to accept up to three words. For example: "há *vinte e três* dias atrás" What changes shall I make? Thanks! Kind regards, >Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Possible English rule
Hello! I typed this on IRC hours ago: [22:13] just to tell you: before leaving to bed I found out an error in my course project... I as calling a function using -F instead of +F We could add a rule: I + AS + VERBGERUND -> I + WAS + VERBGERUND Is it a good rule? Thanks! Kind regards, >Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Update check text on pt_PT
Hello! I haven't found the file that has the check text example, so I was wondering if someone could update the pt_PT one: *"Cola o teu próprio texto aqui... **ou**verifica este texto, **afim de**ver alguns **dos dos**problemas que o LanguageTool consegue detectar. Isto **tal vez**permita corrigir os teus erros **à última da hora**. A gente **vamos**estudar cada vez mais."** * This shows one more rule in use. Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports.http://sdm.link/zohodev2dev___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Morphologic dictionary pt_PT
Hello! I have been reflecting and I have decided that after I finish my PhD (I hope to finish it this year) I will try to add an option to my "Proofing Tool GUI" software to extract .DICs into morphologic dictionaries (provided they have extra information per word). My idea was to grab both pt_PT dictionaries (prereform and postreform), extract them to morphologic and merge both to have pre+post reform working. This way we could always have up to date wordlists. The goal would be to do this in 2017. Is there any information/rules on how to do this task? Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports.http://sdm.link/zohodev2dev___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help creating rule pt_PT
Jaume, just *NCMS000 *creates lots of false positives: "Traduzir em sexo" I believe I really need to have the rule as I planned it. Do you have any suggestions or should I create two rules with "Latim" in one? Thanks! Kind regards, >Marco A.G.Pinto --- On 15/07/2016 11:54, Jaume Ortolà i Font wrote: > Hi, > > Most languages have the postag NCMS000, including "Latim". Try: > > * * > > Regards, > Jaume Ortolà > > 2016-07-15 12:45 GMT+02:00 Marco A.G.Pinto > <marcoagpi...@mail.telepac.pt <mailto:marcoagpi...@mail.telepac.pt>>: > > Hello! > > I am trying to create the following rule: > traduzir *em *LANG -> traduzir *para *LANG > (translate TO LANG) > > *** > **** > ** ** > **traduzir** > **** > **em** > **** > ** postag="AQ0MS0|latim"> ** > ** ** > ** Substituir «em» por > para.** > ** para** > ** Temos de traduzir > em Inglês.** > ** * > > It seems the language names are *AQ0MS0 *but "Latin" ("latim" in > Portuguese doesn't appear as *AQ0MS0*, so I added it by hand to > the rule) doesn't hit the rule while testing in the stand-alone tool. > > How do I fix it? > > Thanks! > > Kind regards, >>Marco A.G.Pinto > > > -- > -- -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports.http://sdm.link/zohodev2dev___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Help creating rule pt_PT
Hello! I am trying to create the following rule: traduzir *em *LANG -> traduzir *para *LANG (translate TO LANG) *** **** ** ** **traduzir** **** **em** **** ** ** ** ** ** Substituir «em» por para.** ** para** ** Temos de traduzir em Inglês.** ** * It seems the language names are *AQ0MS0 *but "Latin" ("latim" in Portuguese doesn't appear as *AQ0MS0*, so I added it by hand to the rule) doesn't hit the rule while testing in the stand-alone tool. How do I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports.http://sdm.link/zohodev2dev___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Unverified OpenPGP signature in LT mailing list
Hello! I noticed that my OpenPGP signed e-mails give an error in the signature due to the mailing list software that damages it. I person reported a similar issue in the Enigmail mailing list and they replied: *"Mailman (a mailing list software) in earlier versions had problems with PGP/MIME signatures. This list is using version 2.1.20 which is known to work. Can't talk about other software."* Daniel, is the software being used Mailman? If so, can it be updated? Thanks! Kind regards, >Marco A.G.Pinto --- -- signature.asc Description: OpenPGP digital signature -- Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San Francisco, CA to explore cutting-edge tech and listen to tech luminaries present their vision of the future. This family event has something for everyone, including kids. Get more information and register today. http://sdm.link/attshape___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Suggestion for English rule - masters thesis > master's thesis
Hello! Using MS Word 2016 I typed "blah blah my masters thesis" and Word suggested me to replace "masters" with "master's". Could this rule be added to LanguageTool? Thanks! Kind regards, >Marco A.G.Pinto --- -- signature.asc Description: OpenPGP digital signature -- Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San Francisco, CA to explore cutting-edge tech and listen to tech luminaries present their vision of the future. This family event has something for everyone, including kids. Get more information and register today. http://sdm.link/attshape___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
English rule: "Involved on/in"
Hello! Sometimes I commit the error: "I am involved in several projects". Maybe the "involved" can only be used with "on" or there are special cases? Could someone create this rule? Thanks! Kind regards, >Marco A.G.Pinto -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports. http://sdm.link/zohomanageengine___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Improving the rules from yesterday
Jaume, Thank you once more! It has worked! Kind regards from your friend, >Marco A.G.Pinto --- On 14/06/2016 10:46, Jaume Ortolà i Font wrote: > Try this: > > > > > É > > pois > [!.] > > Usar vírgula: \1, > Será verdade? É > pois! > > > Regards, > Jaume Ortolà > > > 2016-06-14 11:32 GMT+02:00 Marco A.G.Pinto > <marcoagpi...@mail.telepac.pt <mailto:marcoagpi...@mail.telepac.pt>>: > > Hello Jaume, > > I was wondering if you could help me. > > I wanted to improve yesterday's rule so that if someone types: > *É pois!* OR *É pois.* > it would only suggest a comma after the *É*. > For example: "*Será verdade? É, pois!*" > > I have added the following rules and the one failing is #3aa > (special case). > *** > **** > ** ** > ** ** > **é** > **pois** > **** > ** regexp="yes">[!.,;:–—\(]** > **** > ** ** > ** Usar vírgula: \1, > \2,** > ** Ela é pois > do Norte.** > ** ** > **** ** > **** > **** > **** > ** ** > ** ** > **É** > ** regexp="yes">[,;:–—\(]** > **** > **pois** > **!.** > ** ** > ** Usar vírgula: > \1,** > ** Será verdade? É > pois!** > ** ** > **** > **** > **** > **** > ** ** > ** ** > **é** > **,** > **pois** > **** > ** regexp="yes">[!.,;:–—\(]** > **** > ** ** > ** Usar vírgula: \1, > \3,** > ** Ela é, > pois do Norte.** > **** > **** > **** > **** > **** > ** ** > ** ** > ** é** > **pois** > **** > **,** > **** > ** ** > ** Usar vírgula: \1, > \2** > ** Ela é pois, > do Norte.** > *** > > > Could you tell me how to fix it? > > Thanks! > > Kind regards, > >Marco A.G.Pinto > > > > -- > -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Need help creating rule
Thanks, it has worked! :-P Kind regards, >Marco A.G.Pinto --- On 13/06/2016 12:00, Jaume Ortolà i Font wrote: > Hi Marco, > > If you want to take into account every possibility (only one comma > present, no comma at all) and always give the proper suggestion, > you'll need to write different rules. > > One rule for: "é pois" > > > > > é > pois > > [,;:–—\(] > > Usar vírgula: \1, \2, > Ela é pois do > Norte. > > > One rule for: "é, pois" > > > > > é > , > pois > > [,;:–—\(] > > Usar vírgula: \1, \3, > Ela é, pois do > Norte. > > > One rule for: "é pois," > > > > > é > pois > > , > > Usar vírgula: \1, \2 > Ela é pois, do > Norte. > > > Even using antipattern, I think you can not do it with only one rule. > > Regards, > Jaume Ortolà > > > 2016-06-13 12:29 GMT+02:00 Marco A.G.Pinto > <marcoagpi...@mail.telepac.pt <mailto:marcoagpi...@mail.telepac.pt>>: > > Hello! > > I have been trying to create the following rule, but it gives an > error with "TESTRULES PT": > * > > > é > > [,;:–—\(]exception> > > pois > > [,;:–—\(]exception> > > > Usar vírgula: \1, \2, > Ela é pois do Norte. > * > > > The idea is to suggest commas in: > "É pois" -> "É, pois," > > How do I do it? > > Thanks! > > Kind regards, > >Marco A.G.Pinto > --- > > -- > -- signature.asc Description: OpenPGP digital signature -- What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Please help test Chrome extension beta
I have done a clean installation of Chrome + the add-on on my testing machine and it seems to work fine (pt_PT). On 09/05/2016 21:37, Daniel Naber wrote: > On 2016-05-08 15:30, Daniel Naber wrote: > >>> please help test a new version of the Chrome extension: >> Here's an update, could you also test it, please? > To get more feedback, I've made testing easier: please remove the > existing LT extension and then install this and use it for a few days > and report the results: > > https://chrome.google.com/webstore/detail/languagetool-beta/fpgjchgblnekmbkfplkggjpaopgigngc > > As I mentioned, please also test this (in Chromium/Chrome) if you're a > Firefox user, this extension will soon also become our Firefox add-on. > Testing is really important, as the extension has more than 8000 users > and as soon as I upload the update to the web store, their extension > gets updates automatically. So if there's a serious bug, we'd have 8000 > affected users, all at the same day... > > Regards > Daniel > -- -- Mobile security can be enabling, not merely restricting. Employees who bring their own devices (BYOD) to work are irked by the imposition of MDM restrictions. Mobile Device Manager Plus allows you to control only the apps on BYO-devices by containerizing them, leaving personal data untouched! https://ad.doubleclick.net/ddm/clk/304595813;131938128;j___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Create rule - comma before word(s)
Thanks, Jan, it has worked! On 26/04/2016 20:33, Jan Schreiber wrote: > Hi Marco, > > Here's what I would try: > > > > > > > [,;:—\–\(] > > mas|porém|contudo|entretanto|todavia > > Usar vírgula: \1, > Eu gosto muito de > chocolate mas não posso comer para não engordar. > > > > > [,;:—\–\(] > > no > entanto > > Usar vírgula: \1, > Eu gosto muito de > chocolate no entanto não posso comer para não > engordar. > > > > > Am 26.04.2016 um 15:43 schrieb Marco A.G.Pinto: >> Hello! >> >> Sorry for not asking before but I have been very busy and completely >> forgot about it. >> >> How do I make it work with: >> >> >> >> >> [,;:–—\(] >> >> mas|porém|contudo|entretanto|todavia >> >> >> Usar vírgula: \1, >> Pensa primeiro, ou >> seja escolhe acertadamente. >> >> >> Also, with the words: "no entanto"? >> >> Thanks! >> >> Kind regards, >> >Marco A.G.Pinto >>--- >> >> On 03/03/2016 19:03, Marcin Miłkowski wrote: >>> W dniu 03.03.2016 o 19:06, Marco A.G.Pinto pisze: >>>> Hello! >>>> >>>> The following Portuguese words require a comma before them: >>>> 1) Eu gosto muito de chocolate, *mas *não posso comer para não engordar. >>>> 2) Eu gosto muito de chocolate, *porém *não posso comer para não engordar. >>>> 3) Eu gosto muito de chocolate, *contudo *não posso comer para não >>>> engordar. >>>> 4) Eu gosto muito de chocolate, *no entanto* não posso comer para não >>>> engordar. >>>> 5) Eu gosto muito de chocolate, *entretanto *não posso comer para não >>>> engordar. >>>> 6) Eu gosto muito de chocolate, *todavia *não posso comer para não >>>> engordar. >>>> >>>> >>>> Is there a simple way of adapting the rule Yakov helped me with the >>>> other day?: >>>> * >>>> >>>> >>>> >>>> ou >>>> seja >>>> >>>> , >>>> >>>>Usar vírgula: \1 \2, >>>>Pensa primeiro, ou >>>> seja escolhe acertadamente. >>>> * >>>> >>>> >>>> PS-> Notice that in example 4) it is two words, not one. >>> Actually, I would write it in a slightly more general way. >>> >>> >>> [,;:–—\(] >>> >>> no >>> entanto >>> >>> >>> Why? Just because one usually use an opening parenthesis, a colon etc. >>> instead of a comma in some context. And exceptions have a slightly >>> different logic than negation, which may become tricky in some >>> situations with regular expressions (probably not here, though). >>> >>> Best, >>> Marcin >>> >>>> Thanks! >>>> >>>> Kind regards, >>>> >Marco A.G.Pinto >>>> --- >>>> >>>> -- >>>> >> -- -- -- Find and fix application performance issues faster with Applications Manager Applications Manager provides deep performance insights into multiple tiers of your business applications. It resolves application problems quickly and reduces your MTTR. Get your free trial! https://ad.doubleclick.net/ddm/clk/302982198;130105516;z___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Proofing Tool GUI vs LanguageTool
Hello! I came up with a brilliant idea for my Hunspell tool: I want to add support for LanguageTool, so that instead of editing a XML file, we could have a GUI for it. But, it is better to dedicate my time to the PhD project and leave this to a later time. This makes me dream and I hope that I finish the course this year and then will have the desirable free time to all my projects. Kind regards, >Marco A.G.Pinto --- -- -- Find and fix application performance issues faster with Applications Manager Applications Manager provides deep performance insights into multiple tiers of your business applications. It resolves application problems quickly and reduces your MTTR. Get your free trial! https://ad.doubleclick.net/ddm/clk/302982198;130105516;z___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Error using "TESTRULES PT"
Hello! I don't know what happened, but the command in the subject in the stand-alone tool, which I use to test the rules, gives an error: How can it be fixed? Thanks! Kind regards, >Marco A.G.Pinto --- -- signature.asc Description: OpenPGP digital signature -- Find and fix application performance issues faster with Applications Manager Applications Manager provides deep performance insights into multiple tiers of your business applications. It resolves application problems quickly and reduces your MTTR. Get your free trial! https://ad.doubleclick.net/ddm/clk/302982198;130105516;z___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Automatic Dictionary Switcher
Hello! Has this add-on been removed permanently from Mozilla's pages? What is its status? Thanks! Kind regards, >Marco A.G.Pinto --- -- signature.asc Description: OpenPGP digital signature -- Transform Data into Opportunity. Accelerate data analysis in your applications with Intel Data Analytics Acceleration Library. Click to learn more. http://pubads.g.doubleclick.net/gampad/clk?id=278785351=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Create rule - comma before word(s)
Hello! The following Portuguese words require a comma before them: 1) Eu gosto muito de chocolate, *mas *não posso comer para não engordar. 2) Eu gosto muito de chocolate, *porém *não posso comer para não engordar. 3) Eu gosto muito de chocolate, *contudo *não posso comer para não engordar. 4) Eu gosto muito de chocolate, *no entanto* não posso comer para não engordar. 5) Eu gosto muito de chocolate, *entretanto *não posso comer para não engordar. 6) Eu gosto muito de chocolate, *todavia *não posso comer para não engordar. Is there a simple way of adapting the rule Yakov helped me with the other day?: * ou seja , Usar vírgula: \1 \2, Pensa primeiro, ou seja escolhe acertadamente. * PS-> Notice that in example 4) it is two words, not one. Thanks! Kind regards, >Marco A.G.Pinto --- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: please introduce category IDs
Daniel, Does this mean I should do a replacement like this: *** **** ** ** **à** **bocado** ** ** ** Substituir «à bocado» por há bocado.** ** Cheguei à bocado do Algarve.** *** Replacing: ** with: ** ? Is this the correct procedure? Thanks! Kind regards, >Marco A.G.Pinto On 01/03/2016 13:13, Daniel Naber wrote: Hi, I've added new options to the command-line LT so users can turn on/off categories (--enablecategories, --disablecategories). For this, I've introduced an 'id' attribute to s, just like for s. If you maintain a language, please add id="..." to all categories, e.g. like this: There's no static set of categories, but I still suggest you try to use these and only come up with your own in case these don't cover your categories: https://github.com/languagetool-org/languagetool/blob/master/languagetool-core/src/main/java/org/languagetool/rules/CategoryIds.java#L27-L44 I suggest we use all uppercase names and no special characters, just like for the rule ids. Currently, these ids are optional, but I'd like to make them non-optional to avoid confusion. Also, please add category ids for your Java rules. The code in the constructor changes like this: super.setCategory(new Category(messages.getString("category_grammar"))); => super.setCategory(new Category(Categories.MISC, messages.getString("category_grammar"))); Regards Daniel -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Adding comma in rules
Hello! How do I make LanguageTool suggest commas in rules? I was just working on my thesis and when I wrote: "ou seja blah blah" Microsoft Word 2016 suggested: "ou seja," (with a comma). Thanks! Kind regards, >Marco A.G.Pinto --- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Problem creating rule
Thank you my friend, I have committed the grammar.xml with the improvements. Kind regards, >Marco A.G.Pinto --- On 16/02/2016 20:01, Yakov Reztsov wrote: Hello Marco, Problem of this rule that word * "bonita" is not match to "<*token postag_regexp="yes" postag="AQ0CS0|PI0FS000|PI0MS000|NCMS000">*"* You can exchange postag "NCMS000" and "NCFS000" on second token like this: I think this rules may look like this: * ** **** ** ** **** **um** **** **postag="AQ0CS0|PI0FS000|PI0MS000|*NCFS000 *">** **postag="NCMS000">uma** ** ** ** Erro de concordância um/uma.** ** uma** ** Quero um bonita foto dela.** ** ** ** ** **** **** ** ** **** **uma** **** **postag="AQ0CS0|PI0FS000|PI0MS000|*NCMS000*">** **postag="NCFS000">a** ** ** ** Erro de concordância uma/um.** ** um** ** Quero uma bonito carro novo.** ** * Вторник, 16 февраля 2016, 14:15 UTC от "Marco A.G.Pinto" <marcoagpi...@mail.telepac.pt>: Hello! Only today I had the chance of dedicating some time to improve the complex rule: "um/uma + adj + noun" (male/female). But it is giving an error in TESTRULES PT. What am I doing wrong?:* **** **** ** ** **** **um** **** **** **uma** ** ** ** Erro de concordância um/uma.** ** uma** ** Quero um bonita foto dela.** ** ** ** ** **** **** ** ** **** **uma** **** **** **a** ** ** ** Erro de concordância uma/um.** ** um** ** Quero uma bonito carro novo.** ** * Thanks! Kind regards, >Marco A.G.Pinto --- -- -- Yakov Reztsov -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Problem creating rule
Hello! Only today I had the chance of dedicating some time to improve the complex rule: "um/uma + adj + noun" (male/female). But it is giving an error in TESTRULES PT. What am I doing wrong?:* **** **** ** ** **** **um** **** **postag="AQ0CS0|PI0FS000|PI0MS000|NCMS000">** **postag="NCMS000">uma** ** ** ** Erro de concordância um/uma.** ** uma** ** Quero um bonita foto dela.** ** ** ** ** **** **** ** ** **** **uma** **** **postag="AQ0CS0|PI0FS000|PI0MS000|NCFS000">** **postag="NCFS000">a** ** ** ** Erro de concordância uma/um.** ** um** ** Quero uma bonito carro novo.** ** * Thanks! Kind regards, >Marco A.G.Pinto --- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Rule not working properly
Hello! I have been up since 6am trying to fix my last two rules before going to my weekend job at around noon. This is the status: *** **** ** ** **** **um** **** **postag="AQ0CS0|PI0FS000|NCFS000">uma** **postag="NCMS000">** ** ** ** Erro de concordância um/uma.** ** uma** ** Quero um bonita foto dela.** ** * The above rule with the "uma" exception fixed: "A posse dos resultados do trabalho de cada um uma vez que" *** **** ** ** **** **uma** **** **postag="AQ0CS0|PI0MS000|NCMS000">a** **postag="NCFS000">** ** ** ** Erro de concordância uma/um.** ** um** ** Quero uma bonito carro novo.** ** ** * The above rule still triggers "a" even if it is an exception: "A região é subdividida em duas zonas, uma a Ocidente" What is wrong with the second rule? Thanks! Kind regards, >Marco A.G.Pinto --- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Testing rules against corpus
Hello! Is there a way of testing rules to get the same results as the night diff? I mean, all the hits that rules cause? I am not 100% sure but in the past it was possible somehow? I want to improve even more my last two rules but I can't be waiting all the time for the night results e-mail. Today and tomorrow I will be at my weekend job and won't have the chance to do much. Thanks! Kind regards, >Marco A.G.Pinto -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: new feature: show examples on languagetool.org
Hello Daniel, The two strings translated to pt_PT: "Exemplos..." "Desculpa, não encontrámos exemplos para esta regra." Kind regards, >Marco A.G.Pinto --- On 20/01/2016 16:50, Daniel Naber wrote: Hi, I've added a new feature on https://languagetool.org: in the menu of every error you can now open a dialog that shows some examples of the error. Note that a few rules don't have examples (Java rules - the XML rules should all have at least one) and we don't show a correction if there's more than one suggestion as maybe they're not all correct. If you want to translate the menu item and the "sorry, no examples" message to your language you can do so here: https://github.com/languagetool-org/languagetool-website/blob/master/include/header.php#L294-L306 ...or just send me the two strings. Regards Daniel -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help create rule (2) (more details)
Yakov, I have just downloaded the latest stand-alone tool but it still gives an error in *TESTRULES PT* probably because the files you updated last night aren't still in the .ZIP . So, I will wait until 10pm of today (Monday) to download the latest stand-alone. Thank you very much. Kind regards, >Marco A.G.Pinto --- On 18/01/2016 06:46, Yakov Reztsov wrote: Now you can simply add this new rule to grammar.xml a Erro de concordância do plural: no="2" postag_regexp= "yes" postag="NCFP000" postag_replace="NCFS000"/> A vacas come no pasto. Воскресенье, 17 января 2016, 21:30 UTC от "Marco A.G.Pinto" <marcoagpi...@mail.telepac.pt>: Thanks, Yakov, once again, Silly (but important) question: how do I make the changes? :-P Kind regards, >Marco A.G.Pinto --- On 17/01/2016 21:17, Yakov Reztsov wrote: Hello Marco! We can use this way: Erro de concordância do plural: But for this syntax we need make changes in java code of Portuguese module. I commit this changes to Github now. Воскресенье, 17 января 2016, 20:21 UTC от "Marco A.G.Pinto" Hello! I was making some tests and it seems that in the suggestion it simply removes the last "s". This means it won't work with special nouns such as: flor -> flores cão -> cães Is there a way of replacing: *Erro de concordância do plural: * with just saying: "use plural" or "use singular" in the suggestion? Thanks! Kind regards, >Marco A.G.Pinto --- On 17/01/2016 19:04, Marco A.G.Pinto wrote: Yakov, It has worked! Thanks! What if I want to do the opposite now?: "As vaca comem no pasto" to "As vacas comem no pasto" What should I write in: Erro de concordância do plural: Thanks! If all goes well, tonight I will release these rules, which are very important. noun singular -> noun plural noun plural -> noun singular Two rules for female PLUS two rules for male ( A + AS) (O + OS). Thank you very much! Kind regards from your friend, >Marco A.G.Pinto --- On 17/01/2016 18:32, Yakov Reztsov wrote: Hello, I tested this rule on community.languagetool.org, and it no display any error. But *TESTRULES PT* displays error if is not specified. That is full example: a Erro de concordância do plural: A vacas come no pasto. Воскресенье, 17 января 2016, 12:26 UTC от "Marco A.G.Pinto" : Hello! I was messaging Yakov in private but decided to send it to the ML with more details. I was trying to create the following rule: "A vacas come no pasto" So that it would suggest: "A vaca come no pasto" "vacas" is a female plural noun and the error is to tell people to change the noun to singular "vaca" because it has a "a" (singular) before and a 3rd person singular verb after it. But, it must work with all female nouns, thus the ** . There must a way of highlighting the plural noun and tell people to change it to singular. This is what I am trying to do. Based on this rule, I can then create other kind of important rules. The rule gives an error with *TESTRULES PT*: a Erro de concordância do plural. A vaca come no pasto. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Yakov Reztsov -- -- -- Yakov Reztsov -- -- Yakov Reztso -- -- Site24x7 APM Insight: Get Deep Visibility into Application Per
Re: Help create rule (2) (more details)
Hello! I was making some tests and it seems that in the suggestion it simply removes the last "s". This means it won't work with special nouns such as: flor -> flores cão -> cães Is there a way of replacing: *Erro de concordância do plural: regexp_match="(.*)(s)" regexp_replace="$1"/>* with just saying: "use plural" or "use singular" in the suggestion? Thanks! Kind regards, >Marco A.G.Pinto --- On 17/01/2016 19:04, Marco A.G.Pinto wrote: Yakov, It has worked! Thanks! What if I want to do the opposite now?: "As vaca comem no pasto" to "As vacas comem no pasto" What should I write in: Erro de concordância do plural: regexp_match="(.*)(s)" regexp_replace="$1"/> Thanks! If all goes well, tonight I will release these rules, which are very important. noun singular -> noun plural noun plural -> noun singular Two rules for female PLUS two rules for male ( A + AS) (O + OS). Thank you very much! Kind regards from your friend, >Marco A.G.Pinto --- On 17/01/2016 18:32, Yakov Reztsov wrote: Hello, I tested this rule on community.languagetool.org, and it no display any error. But *TESTRULES PT* displays error if is not specified. That is full example: a Erro de concordância do plural: no="2" regexp_match="(.*)(s)" regexp_replace="$1"/> A vacas come no pasto. Воскресенье, 17 января 2016, 12:26 UTC от "Marco A.G.Pinto" : Hello! I was messaging Yakov in private but decided to send it to the ML with more details. I was trying to create the following rule: "A vacas come no pasto" So that it would suggest: "A vaca come no pasto" "vacas" is a female plural noun and the error is to tell people to change the noun to singular "vaca" because it has a "a" (singular) before and a 3rd person singular verb after it. But, it must work with all female nouns, thus the ** . There must a way of highlighting the plural noun and tell people to change it to singular. This is what I am trying to do. Based on this rule, I can then create other kind of important rules. The rule gives an error with *TESTRULES PT*: a Erro de concordância do plural. A vaca come no pasto. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Yakov Reztsov -- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help create rule (2) (more details)
Thanks, Yakov, once again, Silly (but important) question: how do I make the changes? :-P Kind regards, >Marco A.G.Pinto --- On 17/01/2016 21:17, Yakov Reztsov wrote: Hello Marco! We can use this way: Erro de concordância do plural: no="2" postag_regexp= "yes" postag="NCFP000" postag_replace="NCFS000"/> But for this syntax we need make changes in java code of Portuguese module. I commit this changes to Github now. Воскресенье, 17 января 2016, 20:21 UTC от "Marco A.G.Pinto" Hello! I was making some tests and it seems that in the suggestion it simply removes the last "s". This means it won't work with special nouns such as: flor -> flores cão -> cães Is there a way of replacing: *Erro de concordância do plural: * with just saying: "use plural" or "use singular" in the suggestion? Thanks! Kind regards, >Marco A.G.Pinto --- On 17/01/2016 19:04, Marco A.G.Pinto wrote: Yakov, It has worked! Thanks! What if I want to do the opposite now?: "As vaca comem no pasto" to "As vacas comem no pasto" What should I write in: Erro de concordância do plural: Thanks! If all goes well, tonight I will release these rules, which are very important. noun singular -> noun plural noun plural -> noun singular Two rules for female PLUS two rules for male ( A + AS) (O + OS). Thank you very much! Kind regards from your friend, >Marco A.G.Pinto --- On 17/01/2016 18:32, Yakov Reztsov wrote: Hello, I tested this rule on community.languagetool.org, and it no display any error. But *TESTRULES PT* displays error if is not specified. That is full example: a Erro de concordância do plural: A vacas come no pasto. Воскресенье, 17 января 2016, 12:26 UTC от "Marco A.G.Pinto" : Hello! I was messaging Yakov in private but decided to send it to the ML with more details. I was trying to create the following rule: "A vacas come no pasto" So that it would suggest: "A vaca come no pasto" "vacas" is a female plural noun and the error is to tell people to change the noun to singular "vaca" because it has a "a" (singular) before and a 3rd person singular verb after it. But, it must work with all female nouns, thus the ** . There must a way of highlighting the plural noun and tell people to change it to singular. This is what I am trying to do. Based on this rule, I can then create other kind of important rules. The rule gives an error with *TESTRULES PT*: a Erro de concordância do plural. A vaca come no pasto. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Yakov Reztsov -- -- -- Yakov Reztsov -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help create rule (2) (more details)
Yakov, It has worked! Thanks! What if I want to do the opposite now?: "As vaca comem no pasto" to "As vacas comem no pasto" What should I write in: Erro de concordância do plural: regexp_match="(.*)(s)" regexp_replace="$1"/> Thanks! If all goes well, tonight I will release these rules, which are very important. noun singular -> noun plural noun plural -> noun singular Two rules for female PLUS two rules for male ( A + AS) (O + OS). Thank you very much! Kind regards from your friend, >Marco A.G.Pinto --- On 17/01/2016 18:32, Yakov Reztsov wrote: Hello, I tested this rule on community.languagetool.org, and it no display any error. But *TESTRULES PT* displays error if is not specified. That is full example: a Erro de concordância do plural: no="2" regexp_match="(.*)(s)" regexp_replace="$1"/> A vacas come no pasto. Воскресенье, 17 января 2016, 12:26 UTC от "Marco A.G.Pinto" : Hello! I was messaging Yakov in private but decided to send it to the ML with more details. I was trying to create the following rule: "A vacas come no pasto" So that it would suggest: "A vaca come no pasto" "vacas" is a female plural noun and the error is to tell people to change the noun to singular "vaca" because it has a "a" (singular) before and a 3rd person singular verb after it. But, it must work with all female nouns, thus the ** . There must a way of highlighting the plural noun and tell people to change it to singular. This is what I am trying to do. Based on this rule, I can then create other kind of important rules. The rule gives an error with *TESTRULES PT*: a Erro de concordância do plural. A vaca come no pasto. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Yakov Reztsov -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Help create rule (2) (more details)
Hello! I was messaging Yakov in private but decided to send it to the ML with more details. I was trying to create the following rule: "A vacas come no pasto" So that it would suggest: "A vaca come no pasto" "vacas" is a female plural noun and the error is to tell people to change the noun to singular "vaca" because it has a "a" (singular) before and a 3rd person singular verb after it. But, it must work with all female nouns, thus the *postag="NCFP000">* . There must a way of highlighting the plural noun and tell people to change it to singular. This is what I am trying to do. Based on this rule, I can then create other kind of important rules. The rule gives an error with *TESTRULES PT*: a Erro de concordância do plural. A vaca come no pasto. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Help create rule
Hello! I was trying to create the following rule: "A vacas come no pasto" a Erro de concordância do plural. A vaca come no pasto. I want LanguageTool to highlight NCFP000 as being incorrect but TESTRULES PT gives an error. How can I fix it? Thanks! Kind regards, >Marco A.G.Pinto -- -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: user support in our forum
Done! On 11/01/2016 21:00, Daniel Naber wrote: Hi, our new forum seems to be working fine. However, one problem of the old forum seems to remain: I'm almost the only one who replies to user questions. So I ask all LT contributors to subscribe to the forum so they get forum posts by email and reply to questions, if possible (don't worry, the forum is still low traffic). Here's how to subscribe: -Login in at http://forum.languagetool.org -click your user icon -click the gear icon to visit your settings page -scroll down to "Email" and activate "Send me an email for every new post (unless I mute the topic or category)" Regards Daniel -- -- Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Issue in rule
Hello! I have been trying to create the following rule: *** **** ** ** ** ** **o** **anos|onus|ânus ** **da** **prova** ** ** ** ** ** Usar «o ónus da prova»** ** o anos da prova.** ** o ónus da prova. ** *** After "TESTRULES PT" I tried to test it in the stand alone tool, it suggests the correction but when I click in the suggestion, it does nothing. What am I doing wrong? Thanks! Kind regards, >Marco A.G.Pinto -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Error creating a pt_PT rule
Hello! I tried to create the following rule: *** **** ** ** **** **nos ** **** ** ** ** ** ** Substituir «nos» por nós.** ** Todos nos sabemos. ** ** * But when I type: "testrules pt" on Windows CLI it says: "blah blah The Portuguese rule: NOS_VERBO[1] (POS tag), token [2], contains "VMIP1P0" that is marked as regular expression but probably is not." How do I fix this? Thanks! Kind regards, >Marco A.G.Pinto -- -- Go from Idea to Many App Stores Faster with Intel(R) XDK Give your users amazing mobile app experiences with Intel(R) XDK. Use one codebase in this all-in-one HTML5 development environment. Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs. http://pubads.g.doubleclick.net/gampad/clk?id=254741911=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Error creating a pt_PT rule
Thanks, it worked! On 03/12/2015 13:28, Mike Unwalla wrote: Remove postag_regexp="yes" The postag VMIP1P0 is not a regular expression. Regards, Mike Unwalla Contact: www.techscribe.co.uk/techw/contact.htm -Original Message----- From: Marco A.G.Pinto [mailto:marcoagpi...@mail.telepac.pt] Sent: 03 December 2015 12:51 To: Mailing List - LanguageTool Subject: Error creating a pt_PT rule Hello! I tried to create the following rule: nos Substituir «nos» por nós. Todos nos sabemos. But when I type: "testrules pt" on Windows CLI it says: "blah blah The Portuguese rule: NOS_VERBO[1] (POS tag), token [2], contains "VMIP1P0" that is marked as regular expression but probably is not." How do I fix this? Thanks! Kind regards, >Marco A.G.Pinto -- -- Go from Idea to Many App Stores Faster with Intel(R) XDK Give your users amazing mobile app experiences with Intel(R) XDK. Use one codebase in this all-in-one HTML5 development environment. Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs. http://pubads.g.doubleclick.net/gampad/clk?id=254741911=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Chrome landing page
It looks very good! :-P Kind regards, >Marco A.G.Pinto --- On 20/11/2015 17:20, Daniel Naber wrote: Hi, please check out the page I've added for our Chrome extension: https://languagetool.org/chrome/ It's not linked yet, so we can fix errors if there are any. I'm planning to build the same for Firefox, of course. The idea of a landing page is that people can better find us with search engines if they're searching for topics more specific than "grammar checking". Plus, it provides more detailed information to users who haven't decided yet whether to install the extension. Regards Daniel -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Better pt_PT in Firefox add-on
Hello Daniel and all, The other day I reformatted my 10,1'' netbook (Windows 10 in it) which is my test computer for pt_PT and not only. I tested there the Chrome add-on and yesterday I gave a look to the Firefox add-on. I found out that I need to improve pt_PT add-on for FF and make a few fixes. Since the new LT update will be at the end of the year, does that mean I have time to do all the fixes and that they will be implemented? Thanks! Kind regards, >Marco A.G.Pinto -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Chrome extension translation
Daniel, I have done a couple of fixes in pt_PT. However, I don't know how to see all the translation sentences in Chrome. Kind regards, >Marco A.G.Pinto On 09/11/2015 12:53, Daniel Naber wrote: On 2015-11-08 14:50, Juan Martorell wrote: and German. It's just 28 strings, most of them quite short so I estimate it takes less than half an hour. Done; 19 minutes. thanks, a new version (0.8.5) is now online with these new translations: ru, it, pt_PT, sl, es Please test your translations: https://chrome.google.com/webstore/detail/languagetool/oldceeleldhonbafppcapldpdifcinji Regards Daniel -- -- Presto, an open source distributed SQL query engine for big data, initially developed by Facebook, enables you to easily query your data on Hadoop in a more interactive manner. Teradata is also now providing full enterprise support for Presto. Download a free open source copy now. http://pubads.g.doubleclick.net/gampad/clk?id=250295911=/4140___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Chrome extension translation
Daniel, it is done. However I have a question on the last sentence: "Usage: Select the text that you want to check, or click in a text field if you want to check its contents. Then click on the LT icon, which is displayed in the upper right corner or your browser. Alternatively, use Ctrl+Shift+Space as a shortcut" Is it "or" or "on"? Thanks! Kind regards, >Marco A.G.Pinto --- On 05/11/2015 10:22, Daniel Naber wrote: Hi, the Chrome extension is now ready for translation at Transifex. Please help translate it so the next release can support more than just English and German. It's just 28 strings, most of them quite short so I estimate it takes less than half an hour. Regards Daniel -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Compound words for en_GB + pt_PT
On 29/08/2015 09:53, Daniel Naber wrote: On 2015-08-28 10:50, Marco A.G.Pinto wrote: I have used Proofing Tool GUI to extract all the compounds from the British and Portuguese dictionaries: - en_GB - 5390 compounds https://dl.dropboxusercontent.com/u/30674540/en_gb_compound_words_5390words_20150828.txt I think a native speaker should comment on that. I'm not sure if English is really that strict with its compounds, i.e. if all those words really *have* to be written with a dash (and not without, or as two words). You are right, a native speaker should comment on it :-P I also extracted the compounds of pt_PT and it gave a 10 MB file with 648489 words. Can I add it to the Portuguese compounds words .txt or is it just too big? I'm not sure... is the rule really so useful that it's worth it? Does it create false alarms? It is possible that some false alarms may occur. I guess when I have some free time I will try to check the .txt word by word and start adding to LT the ones that will cause no alarms. Kind regards from your friend, Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Compound words for en_GB + pt_PT
Update: I have just added * at the end of each word to make it compliant with the way LanguageTool works. On 28/08/2015 09:50, Marco A.G.Pinto wrote: Hello! I have used Proofing Tool GUI to extract all the compounds from the British and Portuguese dictionaries: - en_GB - 5390 compounds https://dl.dropboxusercontent.com/u/30674540/en_gb_compound_words_5390words_20150828.txt - pt_PT - 648489 compounds https://dl.dropboxusercontent.com/u/30674540/pt_PT_compound_words_648489words_20150828.txt My question is if the en_GB ones could be added to the bottom of English compounds .txt by the official maintainer with: *# Compounds en_GB - 2015-08-28* before the list. The reason for the # is because there are probably some missing and incorrect words there and, as I add words to en_GB, I fix here and there. I also extracted the compounds of pt_PT and it gave a 10 MB file with 648489 words. Can I add it to the Portuguese compounds words .txt or is it just too big? I will also add a # just like for en_GB because as the dictionary is improved, there will be fixes and new compounds. Thanks! Kind regards, Marco A.G.Pinto --- -- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Question about creating a rule (Portuguese)
Hello! I want to create the rule: CAPTURAR imagens - CAPTAR imagens (capture images) But I wanted to allow to a word to be placed or not between both words. CAPTURAR AS imagens CAPTURAR VÁRIAS imagens etc. How do I create it? I have written: *!-- CAPTURAR IMAGENS captar imagens --** **rule id=CAPTURAR_IMAGENS name=captar imagens** ** pattern** **marker ** **tokencapturar/token** **/marker** **token/token** **tokenimagens/token** ** /pattern** ** messageSubstituir «capturar» por suggestioncaptar/suggestion./message** ** example correction=captarVamos markercapturar/marker imagens./example** **/rule* Thanks! Kind regards, Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Fwd: Re[2]: Fwd: Re: pt_PT - spelling.txt
Thanks, Yakov and Daniel! Kind regards, Marco A.G.Pinto - On 29/07/2015 21:46, Yakov Reztsov wrote: Hi! Вторник, 28 июля 2015, 22:11 +02:00 от Daniel Naber : On 2015-07-28 21:44, Yakov Reztsov wrote: Step to reproduce: 1. Computer: Linux 32-bit, Java 7 2. LanguageTool-20150726-snapshot.zip 3. java -jar languagetool.jar 4. Language is already set to English or Russian. 5. User select language to Portuguese. 6. A window appears with an error. I cannot reproduce this on Java 1.7.0_51 (64bit). But looking for the error message I found it could be caused by calling shutdown(). Can you see if commenting out languageTool.shutdown(); in LanguageToolSupport fixes it? Yes, It fixes this error. -- Yakov Reztsov -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
pt_PT - spelling.txt
Hello! I have been sending e-mails to Yakov asking for help and I am thankful for all his help. The replace.txt has non-Portuguese words I added and they appear in the stand-alone tool as: 1) typo 2) replace 3) rule in grammar.xml I have just added to spelling.txt the words to extend the dictionary and now I only need to remove them from 3). However, the word New York produces lots of errors in the stand-alone since it says it has a space. Is there something that can be fixed in the developers side or should I leave out words with spaces? Thanks! Kind regards, Marco A.G.Pinto --- -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: pt_PT - Added: replace.txt
Done! What about now? Is it okay to make it work? Thanks! Kind regards, Marco A.G.Pinto -- On 14/07/2015 13:30, Yakov Reztsov wrote: Нi! You need to replace all the strings en. to pt.. Вторник, 14 июля 2015, 12:21 +01:00 от Marco A.G.Pinto t: Hi?! Has anyone received my e-mail with the file? I haven't received an answer so far :'( Thanks! Kind regards, Marco A.G.Pinto - On 13/07/2015 14:26, Marco A.G.Pinto wrote: Hello! I have edited the EnglishReplaceRule.java I have changed the text but kept the original filenames and such since I didn't know what to replace there. In case the attachment is lost, here is in Dropbox: https://dl.dropboxusercontent.com/u/30674540/PortugueseReplaceRule.java Daniel, can you or someone make it available to pt_PT and make the filename changes? Thanks! Kind regards, Marco A.G.Pinto -- On 11/07/2015 09:39, Daniel Naber wrote: On 2015-07-10 20:18, Marco A.G.Pinto wrote: Could someone activate it? After it is activated, I will remove them from grammar.xml . We'll need some strings for that rule: -Message -Description -Short description -Two examples, one incorrect and one correct Also see https://github.com/languagetool-org/languagetool/blob/master/languagetool-language-modules/pl/src/main/java/org/languagetool/rules/pl/SimpleReplaceRule.java for examples. Regards Daniel Yakov Reztsov -- /* LanguageTool, a natural language style checker * Copyright (C) 2005-2015 Daniel Naber (http://www.danielnaber.de) * * This library is free software; you can redistribute it and/or * modify it under the terms of the GNU Lesser General Public * License as published by the Free Software Foundation; either * version 2.1 of the License, or (at your option) any later version. * * This library is distributed in the hope that it will be useful, * but WITHOUT ANY WARRANTY; without even the implied warranty of * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU * Lesser General Public License for more details. * * You should have received a copy of the GNU Lesser General Public * License along with this library; if not, write to the Free Software * Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 * USA */ package org.languagetool.rules.pt; import org.apache.commons.lang.StringUtils; import org.languagetool.rules.AbstractSimpleReplaceRule; import org.languagetool.rules.Example; import org.languagetool.rules.ITSIssueType; import java.io.IOException; import java.util.List; import java.util.Locale; import java.util.Map; import java.util.ResourceBundle; /** * A rule that matches words or phrases which should not be used and suggests * correct ones instead. * * @author Marco A.G.Pinto */ public class PortugueseReplaceRule extends AbstractSimpleReplaceRule { public static final String PORTUGUESE_SIMPLE_REPLACE_RULE = PT_PT_SIMPLE_REPLACE; private static final MapString, ListString wrongWords = load(/pt/replace.txt); private static final Locale PT_PT_LOCALE = new Locale(pt-PT); @Override protected MapString, ListString getWrongWords() { return wrongWords; } public BritishReplaceRule(final ResourceBundle messages) throws IOException { super(messages); setLocQualityIssueType(ITSIssueType.LocaleViolation); addExamplePair(Example.wrong(Vou a markerNew York/marker para a semana), Example.fixed(Vou a markerNova Iorque/marker para a semana)); } @Override public final String getId() { return PORTUGUESE_SIMPLE_REPLACE_RULE; } @Override public String getDescription() { return Palavras estrangeiras facilmente confundidas em Português; } @Override public String getShort() { return Estrangeirismo; } @Override public String getMessage(String tokenStr, ListString replacements) { return tokenStr + é um estrangeirismo. Em Português é mais comum usar: + StringUtils.join(replacements, , ) + .; } @Override public boolean isCaseSensitive() { return false; } @Override public Locale getLocale() { return PT_PT_LOCALE; } } -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: pt_PT - Added: replace.txt
Hi?! Has anyone received my e-mail with the file? I haven't received an answer so far :'( Thanks! Kind regards, Marco A.G.Pinto - On 13/07/2015 14:26, Marco A.G.Pinto wrote: Hello! I have edited the EnglishReplaceRule.java I have changed the text but kept the original filenames and such since I didn't know what to replace there. In case the attachment is lost, here is in Dropbox: https://dl.dropboxusercontent.com/u/30674540/PortugueseReplaceRule.java Daniel, can you or someone make it available to pt_PT and make the filename changes? Thanks! Kind regards, Marco A.G.Pinto -- On 11/07/2015 09:39, Daniel Naber wrote: On 2015-07-10 20:18, Marco A.G.Pinto wrote: Could someone activate it? After it is activated, I will remove them from grammar.xml . We'll need some strings for that rule: -Message -Description -Short description -Two examples, one incorrect and one correct Also see https://github.com/languagetool-org/languagetool/blob/master/languagetool-language-modules/pl/src/main/java/org/languagetool/rules/pl/SimpleReplaceRule.java for examples. Regards Daniel -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Italics in Latin/Foreign words pt_PT
Hello! Sorry to bring the subject again but this feature is a lot missed in OpenOffice and LibreOffice. There is a need for suggesting the use of italic for these kind of words. Has something changed or can be changed to allow its use? Please see the screenshot of MS Word 365: https://dl.dropboxusercontent.com/u/30674540/MS_Office365_italics_20150710.png This is needed in AOO/LO and could be done by having a separate .txt file with the words. Thanks! Kind regards, Marco A.G.Pinto --- -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Fwd: AOO 5.0 - Portuguese dictionary pre- and -post reform
Daniel, I have just asked in the developers channel on IRC: [15:20] marcoagpinto erAck: In which version of LO will the pt_PT 1990 pos-reform start working? I need to inform LanguageTool to make special workings for it [15:22] marcoagpinto I mean, to be able to select pre- and post- reform [15:22] @erAck marcoagpinto: 5.0.0 IF the system uses a recent language-subtag-registry with liblangtag, or on Windows where we bundle our own [15:23] marcoagpinto thanks, I am going to reply to Daniel Naber right away [15:23] @erAck marcoagpinto: note the tag is pt-PT-ao1990 So, in 5.0.0 it will be available. Thanks! Kind regards from your friend, Marco A.G.Pinto -- On 10/07/2015 15:11, Daniel Naber wrote: On 2015-07-10 15:13, Marco A.G.Pinto wrote: After it is implemented, does that mean that finally we will be able to use the compounds pre- and post- pt_PT language agreement like I suggested a year or two ago? My understanding is that if you can select a language and its variant in the character/font dialog of LO, we can detect it from LT and activate the rule set that fits. So far this was not the case for pre/post-reform Portuguese and in LO 4.4 I cannot see it yet either. Regards Daniel -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
pt_PT - Added: replace.txt
Hello! I have created the file: replace.txt I had rules in grammar.xml for English words which should be written in Portuguese: 1) device dispositivo 2) devices dispositivos 3) New York Nova Iorque Now I added them to this file. Could someone activate it? After it is activated, I will remove them from grammar.xml . Thanks! Kind regards, Marco A.G.Pinto --- -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
English Dictionaries updated - 1-JUL-2015
Hello! I have released a new version of the English dictionaries for OpenOffice: - en_GB (British) (updated: 2015-07-01)* * en_GB has 634 new words. You canwait a few days until AOO triggers it or download it from the Extension site: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Also, the Mozilla en_GB: https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 And the Proofing Tool GUI project site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html (please notice the new link at the top: Get Involved) PS-I know it is still not officially the 1st of July but I have the update ready/packed for a couple of days or so. Thanks! Kind regards from your friend, Marco A.G.Pinto -- -- -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: reminder: feature freeze for 3.0
Done! On 24/06/2015 14:32, Daniel Naber wrote: On 2015-06-24 15:16, Marco A.G.Pinto wrote: In Transifex, what is: ngram data directory: (ngram) See http://wiki.languagetool.org/finding-errors-using-big-data and https://en.wikipedia.org/wiki/N-gram Regards Daniel -- -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: reminder: feature freeze for 3.0
Daniel, In Transifex, what is: ngram data directory: (ngram) Thanks! Kind regards, Marco A.G.Pinto -- On 24/06/2015 10:11, Daniel Naber wrote: On 2015-06-22 21:05, Daniel Naber wrote: Finally, please help to keep the translations up-to-date: https://www.transifex.com/projects/p/languagetool/ It took some time, but transifex has now been updated and for most languages there are two strings that need translation. Regards Daniel -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Proofing Tool GUI 3.0 - beta - build 76 - WIP
Hello! If you guys recall, I am improving en_GB with my own Hunspell tool. I have decided to dedicate some time in my vacations to add support for the three flags: 1) chr 2) num 3) long So far I have added the part that grabs the .AFF and optimises it (removes all spaces and creates an array with the position of each code). With numbers it takes 18 seconds on my T4300 laptop with the *gl_ES.aff* after I optimised the code to the limit... before optimising it, it would take near one minute. Since numbers have a variable size, that is what makes it slower, since I have to check when the number ends (last digit). I found dictionaries in my hdd with *2) num* and *3) long* which I had access to some time ago thanks to the LanguageTool team. I don't understand how recursivity works: *gl_ES.aff* SFX 520 er éren/666,134,135 er . is:infinitivo P6 + enclítico *nl.aff* SFX Yb 0 jes-/CaCbCp [^m] ts:NN2r These two examples shown above are from Mozilla dictionaries. Could one of you send me a small dictionary that I could test with Proofing Tool GUI and explain how I should make it work? Thanks! Kind regards, Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
English Dictionaries updated - 1-JUN-2015
Hello! I have released a new version of the English dictionaries for OpenOffice: - en_CA (Canadian) (updated: 2015-05-18) - en_GB (British) (updated: 2015-06-01)* - en_US (American) (updated: 2015-05-18) * en_GB has 564 new words. You canwait a few days until AOO triggers it or download it from the Extension site: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Also, the Mozilla en_GB: https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 And the Proofing Tool GUI project site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html In the second half of June I will try to enhance Proofing Tool GUI (the Hunspell tool I use for editing the dictionaries) in order for it to accept also: 1) FLAG long 2) FLAG num At first I thought it would be very hard but, during sleep, I came up with an easy idea (just waiting for my vacation). Still, it won't support recursivity since it is very complex... but I am always coming up with new ideas... and maybe I will figure out something then. Also, I only added 564 new words this time because my PhD supervisor has been over me saying More studying and less hobbies :-( Thanks! Kind regards from your friend, Marco A.G.Pinto -- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Last night's LanguageTool nightly diff test
Hello! I haven't received the last night e-mail. Could someone forward it to me so that I may know how pt_PT worked with my fixes? Thanks! Kind regards, Marco A.G.Pinto -- -- -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Morphologic issue - False positives
Hello! I have received the hits of LanguageTool and the two masculine/feminine rules I have created (um uma AND uma um) generate lots of false positives: http://languagetool.org/regression-tests/20150504/result_pt_20150504.html The sentence: A sua ideia de luz quântica foi um corte com a Física clássica. um corte suggests uma. I checked the morphologic structure in LanguageTool's site and corte appears as: NCFS000 NCMS000 It is both masculine and feminine. The XML code I have is: !-- Concordance gender - UM UMA -- rule id=UM_UMA name=Erro de concordância UM-UMA pattern marker tokenum/token /marker token postag=NCFS000/token /pattern messageErro de concordância um/uma./message suggestionuma/suggestion example correction=umaQuero markerum/marker foto dela./example /rule !-- Concordance gender - UMA UM -- rule id=UMA_UM name=Erro de concordância UMA-UM pattern marker tokenuma/token /marker token postag=NCMS000/token /pattern messageErro de concordância uma/um./message suggestionum/suggestion example correction=umQuero markeruma/marker carro novo./example /rule There are other examples: uma importante in which importante appears as: AQ0CS0 NCMS000 How shall I do it? Thanks! Kind regards, Marco A.G.Pinto -- -- -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Create a pt_PT rule: HÁ n SEGUNDOS/MINUTOS/HORAS/DIAS/MESES/ANOS ATRÁS (remove ATRÁS)
Thanks, Yakov! It worked like a charm! Not even commercial software have this rule! :-P Kind regards from your friend, Marco A.G.Pinto --- On 30/04/2015 21:03, Yakov Reztsov wrote: Hello! You rule is like this: !-- HÁ n SEGUNDOS/MINUTOS/HORAS/DIAS/MESES/ANOS ATRÁS (remove ATRÁS) -- rule id=HÁ-ATRÁS name=há n tempo atrás pattern tokenhá/token token/token token regexp=yessegundos?|minutos?|horas?|dias?|mês|meses|anos?/token tokenatrás/token /pattern messageO verbo haver remove a necessidade de usar atrás: suggestion\1 \2 \3/suggestion/message example correction=HÁ n SEGUNDOSmarkerHÁ n SEGUNDOS ATRÁS/marker/example /rule I tested this rule on community.languagetool.org, and I got 21 matches. -- Yakov Reztsov Четверг, 30 апреля 2015, 18:55 +01:00 от Marco A.G.Pinto marcoagpi...@mail.telepac.pt: Hello! How do I create a rule that asks to remove the word atrás when há is used since it is redundant? The rule in English for example would be: 10 days ago - há 10 dias atrás The verb há in Portuguese removes the need to use atrás. I triggered here the plural and singular forms of time. I just need to know how to make the rule suggest the removal of atrás. !-- HÁ n SEGUNDOS/MINUTOS/HORAS/DIAS/MESES/ANOS ATRÁS (remove ATRÁS) -- rule id=HÁ-ATRÁS name=há n tempo atrás pattern tokenhá/token token/token ; HOW TO MAKE IT ACCEPT ANY WORD HERE? token regexp=yessegundos?|minutos?|horas?|dias?|mês|meses|anos?/token tokenatrás/token /pattern messageO verbo haver remove a necessidade de usar atrás./message example correction=/example /rule Thanks! Kind regards, Marco A.G.Pinto -- -- -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Fwd: LanguageTool nightly diff test
Hello Daniel, Last night, after the diff test full of false positives for pt_PT, I released a new grammar.xml to fix it. However, tonight I received another diff that still shows the errors I have fixed. But, I just installed the nightly build and checked the stand-alone and OpenOffice and in them, it works okay. Maybe only tomorrow night the diff will be correct? The problem was that it was also triggering the word e instead of a: *-Line 1, column 86, Rule ID: A_AS2[1]** **-Message: Erro de concordância do plural.** **-Suggestion: as** **-...babilônios, gregos, chineses, indianos, iranianos **e**maias realizaram observações metódicas do céu not...** **- ^ ** **-Line 1, column 86, Rule ID: A_AS4[1]** **-Message: Erro de concordância do plural.** **-Suggestion: as** **-...babilônios, gregos, chineses, indianos, iranianos **e**maias realizaram observações metódicas do céu not...** **- ^ ** **-** **-Title: Astronomia* Thanks! Kind regards, Marco A.G.Pinto -- Forwarded Message Subject:LanguageTool nightly diff test Date: Fri, 24 Apr 2015 23:09:45 +0200 (CEST) From: dna...@users.sourceforge.net Reply-To: languagetool-devel@lists.sourceforge.net To: languagetool-comm...@lists.sourceforge.net LanguageTool Nightly Diff Overview 2015-04-24 22:20 This page lists the results of our automatic nightly testing against a fixed Wikipedia and Tatoeba corpus with 4 sentences per language. Changes 2015-04-23 22:20 to 2015-04-24 22:20 Version: 3.0-SNAPSHOT (2015-04-24 22:01) Not changed: en Not changed: de Not changed: fr Not changed: ru Not changed: br [1]Changed: ca [2]Changed: pl Not changed: it [3]Changed: pt Not changed: es Not changed: nl Not changed: eo Total runtime: 2015-04-24 22:20 to 2015-04-24 23:09 References 1. http://languagetool.org/regression-tests/20150424/result_ca_20150424.html 2. http://languagetool.org/regression-tests/20150424/result_pl_20150424.html 3. http://languagetool.org/regression-tests/20150424/result_pt_20150424.html -- -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
pt_PT: Added postag rules (plural/singular)
Hello! I have added some basic posttag concordance rules (plural/singular): as a a as os o o os I had to create four rules for each. To have just one rule for each, I would need to know how to use: regexp=yes with postags. Can someone tell me how to do it? Thanks! Kind regards, Marco A.G.Pinto -- -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Concordance error pt_PT
Hello! I am at the university right now and I brought my netbook, but I can't do much here. Can you enable the synthesizer for Portuguese? When I arrive home at night I will see what I can do in my main computer. Thanks! Kind regards, Marco A.G.Pinto --- Às 15:01 de 21-04-2015, Jaume Ortolà i Font escreveu: Marco, You can write: rule id=AS_A-OS_O name=Erro de concordância do plural pattern marker token postag='DA0FP0'/token /marker token postag='NCFS000'/token /pattern messageErro de concordância do plural./message suggestiona/suggestion example correction='A'markerAs/marker vaca está no pasto./example exampleA vaca está no pasto./example /rule But if you want to go ahead and write properly more rules like this, you will need to use synthensis and disambiguation. I can enable the synthesizer for Portuguese (the synthesis dictionary is already there), and I can also point out some strategies for disambiguation, that will be similar to the ones used in Catalan, I guess. But it can become quite complicated. Regards, Jaume 2015-04-21 14:26 GMT+02:00 Marco A.G.Pinto marcoagpi...@mail.telepac.pt mailto:marcoagpi...@mail.telepac.pt: Thanks, Jaume! It works without suggestions. What if I want to suggest a to replace as? I tried: !-- Concordance error plural - BRAZILIAN USER-- rule id=AS_A-OS_O name=Erro de concordância do plural pattern token postag='DA0FP0'/token token postag='NCFS000'/token /pattern messageErro de concordância do plural./message *example correction='a'markerAs/marker vaca está no pasto./example* exampleA vaca está no pasto./example /rule But it gives errors in TESTRULES PT. I have to leave to university soon... At night, if you reply, I will try again after I am back home. Thanks! PS-We are in the good path! Kind regards, Marco A.G.Pinto -- On 21/04/2015 13:13, Jaume Ortolà i Font wrote: Hi, You have a problem in the example correction. The rule should look like this: rule id=AS_A-OS_O name=Erro de concordância do plural pattern token postag='DA0FP0'/token token postag='NCFS000'/token /pattern messageErro de concordância do plural./message example correction=''markerAs vaca/marker está no pasto./example exampleA vaca está no pasto./example /rule If there is no suggestion, the correction field has to be empty. Regards, Jaume Ortolà 2015-04-21 13:57 GMT+02:00 Marco A.G.Pinto marcoagpi...@mail.telepac.pt mailto:marcoagpi...@mail.telepac.pt: Hello! I have tried to use an example posted in the forum by a Brazilian user: *!-- Concordance error plural - BRAZILIAN USER--** **rule id=AS_A-OS_O name=Erro de concordância do plural** ** pattern** ** token postag='DA0FP0'/token** ** token postag='NCFS000'/token** ** /pattern** ** messageErro de concordância do plural./message** ** example correction=TESTEmarkerAs vaca/marker está no pasto./example** ** exampleA vaca está no pasto./example** **/rule ** * For example: using plural/singular incorrectly. But, this gives errors when I use TESTRULES PT in the command line. How can I fix it? Thanks! Kind regards, Marco A.G.Pinto -- -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net mailto:Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Languagetool-devel mailing list Languagetool-devel
Concordance error pt_PT
Hello! I have tried to use an example posted in the forum by a Brazilian user: *!-- Concordance error plural - BRAZILIAN USER--** **rule id=AS_A-OS_O name=Erro de concordância do plural** ** pattern** ** token postag='DA0FP0'/token** ** token postag='NCFS000'/token** ** /pattern** ** messageErro de concordância do plural./message** ** example correction=TESTEmarkerAs vaca/marker está no pasto./example** ** exampleA vaca está no pasto./example** **/rule ** * For example: using plural/singular incorrectly. But, this gives errors when I use TESTRULES PT in the command line. How can I fix it? Thanks! Kind regards, Marco A.G.Pinto -- -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Concordance error - pt_PT
Hello Alberto, According to our conversations on Skype last year, you had created a shared folder: - You had placed this example in the folder: https://raw.githubusercontent.com/languagetool-org/languagetool/master/languagetool-language-modules/gl/src/main/resources/org/languagetool/resource/gl/tagset.txt - You told me that you were going to try to follow the instructions on: http://wiki.languagetool.org/developing-a-tagger-dictionary - you told me that you had to revise the tagset defined by José - you told me to try: java -cp languagetool.jar org.languagetool.dev.POSDictionaryBuilder pt-preao.dump org/languagetool/resource/pt/portuguese.info - We had the portuguese.dict in the shared folder This is what I remember. Thanks! Kind regards, Marco A.G.Pinto -- On 15/04/2015 11:19, Alberto Simões wrote: I though I got you a dictionary for that some time ago :-/ Can you dig our old conversation in the emails archive and see where we got stalled? I am very interested in having concordance checking too. Best, Alberto On 14/04/15 22:06, Marco A.G.Pinto wrote: Hello! Alberto and José, is there any news regarding the tagger dictionary? Thanks! Kind regards, Marco A.G.Pinto --- On 14/04/2015 21:40, Jaume Ortolà i Font wrote: Hi Marco, You need a tagger dictionary if you want to find concordance errors. We talked some time ago about adding a tagger dictionary for Portuguese. Is there any news on this? Regards, Jaume 2015-04-14 15:20 GMT+02:00 Marco A.G.Pinto marcoagpi...@mail.telepac.pt mailto:marcoagpi...@mail.telepac.pt: Hello! Could someone explain how to add concordance rules for Portuguese? What I need is to check if a person uses masculine when it was feminine (o-a | a-o) and also singular when it was plural (o-os | a-as) (and vice versa of course). This feature has been missing for a long time in Portuguese and I hoped I could get it to work on LT 3.0. Thanks! Kind regards, Marco A.G.Pinto - -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Concordance error - pt_PT
But, How can I add the rules as I don't have a clue? Thanks! Marco A.G.Pinto -- On 15/04/2015 14:37, Jaume Ortolà i Font wrote: This dictionary is the FreeLing Potuguese dictionary. It was added by me. Alberto suggested using another one provided by him. Regards, Jaume 2015-04-15 14:47 GMT+02:00 Marco A.G.Pinto marcoagpi...@mail.telepac.pt mailto:marcoagpi...@mail.telepac.pt: Daniel, It returns the following (I mixed masculine with feminine words): Kind regards, Marco A.G.Pinto -- On 15/04/2015 13:22, Daniel Naber wrote: On 2015-04-15 14:04, Marco A.G.Pinto wrote: - You had placed this example in the folder: https://raw.githubusercontent.com/languagetool-org/languagetool/master/languagetool-language-modules/gl/src/main/resources/org/languagetool/resource/gl/tagset.txt A tagger for pt has been added 2014-07-08. You can see what it returns by pasting some Portuguese text here: http://community.languagetool.org/analysis/index?lang=pt Regards Daniel -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Concordance error - pt_PT
Daniel, It returns the following (I mixed masculine with feminine words): Kind regards, Marco A.G.Pinto -- On 15/04/2015 13:22, Daniel Naber wrote: On 2015-04-15 14:04, Marco A.G.Pinto wrote: - You had placed this example in the folder: https://raw.githubusercontent.com/languagetool-org/languagetool/master/languagetool-language-modules/gl/src/main/resources/org/languagetool/resource/gl/tagset.txt A tagger for pt has been added 2014-07-08. You can see what it returns by pasting some Portuguese text here: http://community.languagetool.org/analysis/index?lang=pt Regards Daniel -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Concordance error - pt_PT
Hello! Could someone explain how to add concordance rules for Portuguese? What I need is to check if a person uses masculine when it was feminine (o-a | a-o) and also singular when it was plural (o-os | a-as) (and vice versa of course). This feature has been missing for a long time in Portuguese and I hoped I could get it to work on LT 3.0. Thanks! Kind regards, Marco A.G.Pinto -- -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Concordance error - pt_PT
Hello! Alberto and José, is there any news regarding the tagger dictionary? Thanks! Kind regards, Marco A.G.Pinto --- On 14/04/2015 21:40, Jaume Ortolà i Font wrote: Hi Marco, You need a tagger dictionary if you want to find concordance errors. We talked some time ago about adding a tagger dictionary for Portuguese. Is there any news on this? Regards, Jaume 2015-04-14 15:20 GMT+02:00 Marco A.G.Pinto marcoagpi...@mail.telepac.pt mailto:marcoagpi...@mail.telepac.pt: Hello! Could someone explain how to add concordance rules for Portuguese? What I need is to check if a person uses masculine when it was feminine (o-a | a-o) and also singular when it was plural (o-os | a-as) (and vice versa of course). This feature has been missing for a long time in Portuguese and I hoped I could get it to work on LT 3.0. Thanks! Kind regards, Marco A.G.Pinto -- -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net mailto:Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- -- BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT Develop your own process in accordance with the BPMN 2 standard Learn Process modeling best practices with Bonita BPM through live exercises http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_ source=Sourceforge_BPM_Camp_5_6_15utm_medium=emailutm_campaign=VA_SF___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
[SPAM] Re: Rule suggestion
Hello! I am improving the British speller (forked). If you find missing words, please open a case in my GitHub: https://github.com/marcoagpinto/aoo-mozilla-en-dict I will try to add all the words provided they exist in official dictionaries/sources. This means I will investigate each one. Thanks! Kind regards, Marco A.G.Pinto -- On 31/03/2015 22:19, Torsten Wagner wrote: Hi, as I read on the website you are looking for suggestions of new rules, even if they seem to be trivial. I found that the languagetool return a lot of false positive for abbreviations ending with an s. Languagetool assumes that you might use the plural from of the word wrongly. Message: Don't use indefinite articles with plural words. Did you mean a lap or simply LAPS? (deactivate) Correction: a lap; LAPS Context: ...range and the handling of a LAPS required a good amount of experience an... The S here stands for sensor and is not a plural s Other examples of abbreviations which just jump out of my mind are ADHS (german form of Attention deficit hyperactivity disorder ) PS postscript FPS frames per second LTS long term support DDS direct digital synthesis Most of the time those abbreviations are written with all capital letters. Maybe the above rule can be enhanced to exclude words written in all capital letters. Thanks for the great software Torsten -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
English Dictionaries updated - 1-APR-2015
Hello! Just to inform everyone that I have updated the dictionaries for OpenOffice and Mozilla. Only the British one has been changed and it now has 738 new words since the previous version. In this version I focused a lot in the names of places and persons, since it is one of the biggest weaknesses in the dictionaries. The AOO may take a few days to trigger, but you can download it manually from its official site. Direct sites where it can be found: - AOO http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice - Mozilla (Firefox/Thunderbird/SeaMonkey) https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 - Official project site (Proofing Tool GUI) http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html - Official project site (GitHub) https://github.com/marcoagpinto/aoo-mozilla-en-dict Please notice that I added a paragraph with some extra text at the bottom of GitHub to explain to people how to check if a word is already there. Could someone tell me if the text is okay or if should add/remove something? Thanks! Kind regards from your friend, Marco A.G.Pinto -- -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Comprobar o texto
Fixed! Thanks! Marco A.G.Pinto -- On 19/03/2015 17:30, Daniel Naber wrote: On 2015-03-19 17:01, Marco A.G.Pinto wrote: The text in the subject appears in Spanish: The correct text should be: Verificar o texto. How do I fix it? Or can someone fix it for me before the official release? It's in this file: https://github.com/languagetool-org/languagetool/blob/master/languagetool-office-extension/src/main/resources/Addons.xcu Regards Daniel -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Comprobar o texto
Hello! The text in the subject appears in Spanish: The correct text should be: Verificar o texto. How do I fix it? Or can someone fix it for me before the official release? Thanks! Kind regards, Marco A.G.Pinto -- -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Proofing Tool GUI V3.0 - beta - build 72 - 2015-03-05
Hello everyone, Tonight I have updated PTG to build 72. As you probably know, this is an open-source Hunspell tool which I am using to work on the British dictionary. The last build I released was 70 and, since then, I have been able to optimise a lot the processing of wordlists. en_GB now takes almost half the time to process (extract, count, etc). This will also be visible on other dictionaries. I have also added the feature of exporting wordlists with just the hyphens, and that is why I decided to release this build today: Proofing Tool GUI 3.0 - beta - build 72 is available from its official site: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Please notice that it only works with one code chr in the .AFFs (some more recent .AFFs may use other forms of encoding such as numbers and letters several characters long). I tried to improve this, but it is very hard to accomplish. Even more than that form of encoding also uses recursivity in the rules which is a mess without proper documentation. Thanks! Kind regards, Marco A.G.Pinto -- -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: English dictionary status?
Daniel, Your idea is great :-P But, maybe the levels are generated automatically by Kevin using software? And the levels, they have words that are American and others that are written differently in Oxford British. I know my GitHub seems redundant, but I wanted to add there all English Dictionaries, but the other guys vanished long ago... for example: New Zealand, South Africa, Australia... if I could contact those guys I would add there their dictionaries... but they are long gone and I don't know how to reach them. :'( Kind regards from your friend, Marco A.G.Pinto -- On 02/03/2015 13:32, Daniel Naber wrote: On 2015-02-27 16:23, Marco A.G.Pinto wrote: The thing is that SCOWL thing... and the use of common words or whatever... I am not adding the most used words, but the ones that are in the dictionaries. Wouldn't it be possible to do this in Kevin's infrastructure? He already has several levels of words, couldn't your words be added as another level? Currently everything seems to be duplicated: the word lists, the releases, the github projects... Regards Daniel -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: bounty for Firefox grammar checker interface
Why not also Thunderbird? Kind regards, Marco A.G.Pinto -- On 28/02/2015 16:08, Daniel Naber wrote: Hi, I have posted a $150 bounty to bountysource.com for integrating a grammar checker interface to Firefox. Why? Because the way LanguageToolFx - our Firefox extension - works is nice, but it would be much nicer to have a real integration where words get underlined right in the text area. At bountysource.com you can join and add your bounty to this. The bounties add up and, in the best case, someone fixes this issue. For the case it doesn't happen, you can select whether your bounty will be used as a donation or whether you get it back: https://www.bountysource.com/issues/9049134-mozilla-should-has-a-grammar-checker Please join and add a bounty. The minimum is $5 and you need Bitcoin or a PayPal account. Regards Daniel -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: English dictionary status?
Hello! I have been releasing monthly updates and sharing all the missing words I add to GB in Kevin's GitHub. What happens is that I am adding too many words in short periods of time and I swamp Kevin's GitHub with many hundreds of missing words. From the words I share, Kevin only uses some. Since I have an Oxford Gold Account (which I bought just for the task of improving GB - yes, I am silly, you may think), I have access to the whole Oxford examples. What now happens is that, even though I have 40.txt, 50.txt and 60.txt from Kevin, I only add the words from there after checking them myself one at a time. I have found that some are American and others are written differently in Oxford's dictionary. That is why in this month's update (tonight I will release 1-MAR-2015 since it is ready for a day or two) I have 735 new words, almost none of them from Kevin's txts (this happened because of the Gold Account which gave me access to a huge source of words). This is what I wrote on Mozilla's ML a few days ago (please notice that tonight's version has 147 665 words - V2.22): There are three versions on Mozilla's extensions site: *1) Mark Tyndall's* 136 404 words *2) Lucas's **(updated)* 136 404 words - just the FF, TB and SM versions have been changed *3) Marco A.G.Pinto's (forked) - 2.21* 146 930 words - 10 000 new words https://addons.mozilla.org/en-US/firefox/addon/british-english-dictionary-2 This is also the official version that ships with Apache OpenOffice: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice Every month, around 600 new words are added. I didn't change the licence, so it is the same as Mark's and Lucas' I have been trying my GB to become the default in Mozilla and LibreOffice (built-in) but I don't know the people I should annoy... maybe Kevin and László? Also, before packing the new GB I added: 5105) Fri (abbreviation: Friday) 5106) Jun (abbreviation: June) 5107) Jul (abbreviation: July) 5108) Sep (abbreviation: September) Yes, I typed all weekdays and months by hand to make sure they were all there, and these four were missing. Like I mentioned before, I want in a year from now, for people to write Masters/PhD thesis using my GB and LanguageTool. Thanks for your time! Kind regards from your friend, Marco A.G.Pinto - On 27/02/2015 13:06, Daniel Naber wrote: Hi, this questions goes mostly to Marco, but it may be interesting for others, too: I see you're regularly updating your English dictionary, which is great. Is there progress in getting your dictionary and that of Kevin Atkinson closer together? Is there a metric for that, like the size of the diff of both dictionaries (after unmuch and sort)? Regards Daniel -- -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Improving translation
On 22/01/2015 18:33, Daniel Naber wrote: On 2015-01-22 01:24, Marco A.G.Pinto wrote: I wanted to replace on OpenOffice (compound words): Juntos grafia de palavras by Esta palavra é hifenizada But, I can't find it in Transifex. I can't find it in our source code either, where exactly does it appear? Regards Daniel Daniel, OpenOffice underlines the compound words without hyphen and, when we right-click on them, it says: Juntos grafias de palavras Thanks! Kind regards, Marco A.G.Pinto -- -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Improving translation
I believe I have fixed it. Daniel, can you or someone confirm? Thanks! Kind regards, Marco A.G.Pinto -- On 22/01/2015 19:59, Daniel Naber wrote: On 2015-01-22 19:55, Marco A.G.Pinto wrote: OpenOffice underlines the compound words without hyphen and, when we right-click on them, it says: Juntos grafias de palavras It's in PreReformPortugueseCompoundRule.java and PostReformPortugueseCompoundRule.java Regards Daniel -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Improving translation
I reverted it, since it shows three possible tips. The problem is that on the stand-alone tool it shows the correct and if we click in more it shows Juntos blah blah and the correct tip. There must be other way of fixing it in OpenOffice. Thanks! Kind regards, Marco A.G.Pinto -- On 22/01/2015 21:53, Marco A.G.Pinto wrote: I believe I have fixed it. Daniel, can you or someone confirm? Thanks! Kind regards, Marco A.G.Pinto -- On 22/01/2015 19:59, Daniel Naber wrote: On 2015-01-22 19:55, Marco A.G.Pinto wrote: OpenOffice underlines the compound words without hyphen and, when we right-click on them, it says: Juntos grafias de palavras It's in PreReformPortugueseCompoundRule.java and PostReformPortugueseCompoundRule.java Regards Daniel -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Improving translation
Hello! I wanted to replace on OpenOffice (compound words): Juntos grafia de palavras by Esta palavra é hifenizada But, I can't find it in Transifex. Could someone help? Thanks! Kind regards, Marco A.G.Pinto - -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Rule for (blah blah) ;
Hello! I am revising the university course manuals. In one of them, there was the text: (blah blah blah) ;. If you notice, there is a space after the ) and before the ;. M$ Word suggested to remove the space, but on OpenOffice using LanguageTool, there was no suggestion. Could this be added to LT? Thanks! Kind regards, Marco A.G.Pinto -- -- -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
en_GB updated - 2015-01-01
Hello! I have updated en_GB with 814 new words. I have also updated Proofing Tool GUI which now looks better in Kubuntu and has several bug fixes and improvements. http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html In January I will try to add the numbers and characters decoding but it will be basic since it is very complex to use recursivity (this last word isn't in the dictionary... I have just noticed it... I will add it in February) Thanks! Kind regards, Marco A.G.Pinto --- -- -- Dive into the World of Parallel Programming! The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help with unmunch and Icelandic + Galician
Hello Adrián, I didn't clone unmunch, I did my own version based on what I was told. What I came across is that when I began working on PTG in 2013, the documentation and help from mailing lists regarding Hunspell, didn't mention what I discovered recently. The examples and help I had access to didn't mention that prefixes were checked against all generated suffixes. On 05/12/2014 07:33, Adrián Chaves Fernández wrote: The repository where the unmunch script is located is part of a network of repositories. Your output suggest that you did not included submodules when you cloned the repository, see http://stackoverflow.com/questions/3796927/how-to-git-clone-including-submodules Any feedback about the script is appreciated. Kind regards, Marco A.G.Pinto --- -- -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Hunspell unmunching question
Hello! Around a week ago, Peter from England sent me an e-mail suggesting new words to be added to en_GB. One of them was unsubscribe. Here is what appears in Proofing Tool GUI: The strange thing is that I tried the variants in Mozilla and OpenOffice and none of them was marked as a typo. I started meditating about it and wondered if in Hunspell the prefixes would attach themselves to all suffixes. Today I made a test, please see the archive: https://dl.dropboxusercontent.com/u/30674540/hunspell_issue_marcoagpinto_20141204.zip It contains the extracted wordlists both in PTG and Unmunch and also the .DIC + .AFF I created for the tests. In my PTG 3.0 build 67 I get: *subscribe** **resubscribe** **subscribing** **oversubscribe** **subscribes** **subscribed** **unsubscribe** **000** **subscribe** **unsubscribe** **resubscribe** **subscribing** **oversubscribe** **subscribes** **subscribed** ** *In Unmunch for Linux I got: *subscribe** **subscribing** **subscribed** **subscribes** **resubscribing** **oversubscribing** **unsubscribing** **resubscribed** **oversubscribed** **unsubscribed** **resubscribes** **oversubscribes** **unsubscribes** **resubscribe** **oversubscribe** **unsubscribe** **000** **subscribe** **subscribing** **subscribed** **subscribes** **resubscribing** **oversubscribing** **unsubscribing** **resubscribed** **oversubscribed** **unsubscribed** **resubscribes** **oversubscribes** **unsubscribes** **resubscribe** **oversubscribe** **unsubscribe** ** *I placed a 000 to divide the same word with an exchanged order of the code U to make sure it would produce the same results, no matter its position. What this means is that I probably need to change the code of my tool, maybe create three arrays: 1st - to store the words with suffixes 2nd - to store the codes of the prefixes 3rd - to store 1st plus all its combinations with the prefixes (it would apply prefixes to 1st and store them in 3rd ) Then, I would display the prefixes at the bottom in PTG not following the order of the codes? What this also means is that there are hundreds of combinations not appearing in the wordlist which I always publish in .txt in the GitHub of the project but that are processed by Hunspell in Mozilla (Firefox, Thunderbird and SeaMonkey) and Apache OpenOffice. Thanks for your time! Kind regards, Marco A.G.Pinto -- -- -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Proofing Tool GUI - Icelandic + Galician
Hello, I didn't go to work today because I was not feeling well. I have decided to dedicate some time today to improve PTG in order for it also to unmunch .DICs with numbers instead of characters. Daniel and Ruud, could you explain to me in detail how to detect if the .AFF deals with chrs or numbers? Also, could you provide the troubling dictionary for me to analyse and test? I have been swamped with work and only in January I will be on vacation, but I will try to do my best! Thanks! Kind regards from your friend, Marco A.G.Pinto -- -- -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://pubads.g.doubleclick.net/gampad/clk?id=154624111iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Proofing Tool GUI - Icelandic + Galician
Ruud, I have downloaded Dutch from Mozilla and the continuation flags are very complex and hard to understand. I have also downloaded Galician from Mozilla and it seems a bit easier... But, in Galician, what is a: *d\/v* ; it uses \ and / also, I haven't seen in ASCII yet, but the space separating the words and the tags are chr 32 or different?: *da/10 po:contracción* In Portuguese they used a TAB chr or such... if it is not a space and I change to a tab chr, will it still work? :-P Thanks for your time! Kind regards, Marco A.G.Pinto -- On 15/11/2014 14:14, R.J. Baars wrote: In the AFFIX file, default flag is just 1 char. When the clause FLAG num is in the file, the flags are numbers in the 2-byte range, from 1 upt to 65535, separated by a comma (1,2,3,4,555) When the clause FLAG long is in the file, the flags are two chars long, which also translates into 2 bytes internatlly (just plains ascii chars allowed), but there is no separation. Flags might be like Word/AaBbCcDD , actually the flags Aa and Bb etc. So I guess it might be best to use a double byte internally, translating all flags to that. Then there are multiple compounding methods. One using COMPOUNDRULE (mostly for numbers etc, very systematic patterns) COMPOUNDRULE 39 COMPOUNDRULE (N1)(n2) # eenen+zestig[ste] COMPOUNDRULE (G1)*(Le) # 1e - e ... # general compounding, for normal compounding mechanisms. Beware: prefixes only apply to 'first' and suffices to 'last' COMPOUNDBEGIN Ca COMPOUNDMIDDLE Cb COMPOUNDEND Cc COMPOUNDPERMITFLAG Cp ONLYINCOMPOUND Cx The other mechanism is using continuations flags: SFX CA Y 2 SFX CA 0 /CaCp SFX CA 0 -/CaCp (Flag CA creates the option to add words with flag Ca, eiterh wth a - in front of it, or not) You could reduce some ot the complexity by ignoring all 'filters, like 'checkcompoundpattern, onlyincompound etc', because after generating the huge outcom, you could apply Hunspell with the same dic and affixe with the command line option -G te get the correct words from the entire list. The hunspell used had better be 1.3 then, since 1.2* has a bug, suggestion mistaken compound words. Does this help? Ruud Hello, I didn't go to work today because I was not feeling well. I have decided to dedicate some time today to improve PTG in order for it also to unmunch .DICs with numbers instead of characters. Daniel and Ruud, could you explain to me in detail how to detect if the .AFF deals with chrs or numbers? Also, could you provide the troubling dictionary for me to analyse and test? I have been swamped with work and only in January I will be on vacation, but I will try to do my best! Thanks! Kind regards from your friend, Marco A.G.Pinto -- -- -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://pubads.g.doubleclick.net/gampad/clk?id=154624111iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://pubads.g.doubleclick.net/gampad/clk?id=154624111iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://pubads.g.doubleclick.net/gampad/clk?id=154624111iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: LanguageTool nightly diff test
Hello! What does it mean?: -Portuguese: 2909 total matches +Portuguese: 2913 total matches The reason why it got hits is because in pt_BR (Portuguese Brazilian) the guys use equipe and we in Portugal (pt_PT) use equipa. equipa means team, just like your soccer team. In pt_PT, equipe is a verb. The verb equipar. Por favor equipe a sua equipa. Kind regards, Marco A.G.Pinto -- On 29/10/2014 22:04, dna...@users.sourceforge.net wrote: LanguageTool Nightly Diff Overview 2014-10-29 22:20 This page lists the results of our automatic nightly testing against a fixed Wikipedia and Tatoeba corpus with 4 sentences per language. Changes 2014-10-28 22:20 to 2014-10-29 22:20 Version: 2.8-SNAPSHOT (2014-10-29 22:01) Not changed: en Not changed: de Not changed: fr Not changed: ru [1]Changed: br [2]Changed: ca Not changed: pl Not changed: it [3]Changed: pt Not changed: es Not changed: nl Not changed: eo Total runtime: 2014-10-29 22:20 to 2014-10-29 23:04 References 1. http://languagetool.org/regression-tests/20141029/result_br_20141029.html 2. http://languagetool.org/regression-tests/20141029/result_ca_20141029.html 3. http://languagetool.org/regression-tests/20141029/result_pt_20141029.html -- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help with unmunch and Icelandic + Galician
Dear Ruud, To see if it is a tool bug or a rule bug, just edit the word(s) in the Dictionary tab of my tool and it will show a tab containing each rule that generates the derivates. You can edit the words with a double-click or with right-click+EDIT. :-P I am feeling so eager! PS-You must use V3.0 (beta) of Proofing Tool GUI since V2.x is outdated. I wanted to release an official 3.0 but I want to make several improvements to it and I will only have free time in January. Thanks! Kind regards from your friend, Marco A.G.Pinto --- On 27/10/2014 20:02, R.J. Baars wrote: In the output of the tool are also unmunch errors. Ab0 as the derivative if Abel e.g. After exporting and processing into a words list, out of the 2.7 Mb, 2.3 Mb was accepted as a correct word by the same spellchecker. So the 'bag of trics' might still be useful after unmunching using this tool, to collect suggestions and add them to the list, for another round of spell checking. (Hunspell suggests words that are not accepted by it...) Ruud -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Help with unmunch and Icelandic + Galician
Hello! The Tags is the extra information which I only found in the pt_PT dictionary, which tells if each word is masculine, feminine, singular, plural, etc. As for the rules, they are here: For each wrong result you get in the extracted list, you can check here if it is a rule or a tool bug. Thanks! Kind regards, Marco A.G.Pinto -- On 27/10/2014 20:46, R.J. Baars wrote: The first thing I notice is that flags and word are not separated on the screen. I added a picture to show that. When I click edit, it is the same. The / is apparently not seen as a flag indicator in the dictionary. In the dic, you can find flags after the / , comments after # and extra data after a tab (like yy:xxx, which can be ignored for spelling purposes; it is word-type info, a bit like postags) The data that appears after clicking the edit is unclear to me; I don't understand what it is supposed to mean, even though I have quite good knowledge of the Hunspell format and options. Ruud -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Compound words - en_GB
Hello! I was wondering if someone could extract the 5000+ compound words from the wordlist of my forked en_GB and create a list to be used by LT? (just like we have for pt_PT) (or is that list already created but with 1000s of missing words?) Thanks! Kind regards, Marco A.G.Pinto -- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Checkout Failed
Hello! After long minutes downloading files with a checkout in Tortoise, I get a: The XML response contains invalid XML: Is there a way of fixing it? Thanks! Kind regards, Marco A.G.Pinto --- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Compound words - en_GB
Yes, Daniel, The en_GB wordlist is directly located here: http://marcoagpinto.cidadevirtual.pt/wordlist_marcoagpinto_20141001_138348w.txt One could create an application that would read line by line and create a new file with each word that has an hyphen in it. That file could be added as a compounds.txt for en_GB just like we have one for pt_PT. Does this make sense? Thanks! Kind regards from your friend, Marco A.G.Pinto -- On 23/10/2014 16:23, Daniel Naber wrote: On 2014-10-23 09:55, Marco A.G.Pinto wrote: I was wondering if someone could extract the 5000+ compound words from the wordlist of my forked en_GB and create a list to be used by LT? Could you give an example? Regards Daniel -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
British English: -IZE/-ISE
Hello! Since some British people complain that they prefer -ISE, others prefer -IZE and others both, I was wondering if we could add a setting to LT regarding that. This way, one en_GB dictionary would satisfy everyone, and we would rely on LT to do the job. Is it a good idea? Thanks! Kind regards, Marco A.G.Pinto -- -- -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://p.sf.net/sfu/Zoho___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
British dictionary updated - 1-OCT-2014
Hello! I have updated the en_GB dictionary with 824 new words (making a total of over 6000 since I grabbed the project last year). It can be downloaded from: https://github.com/marcoagpinto/aoo-mozilla-en-dict Thanks! Kind regards, Marco A.G.Pinto -- -- -- Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer http://pubads.g.doubleclick.net/gampad/clk?id=154622311iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Fwd: English dictionaries updated - 2014-09-01
Hello Kevin, Daniel and Mailing list! I have updated en_GB and also added the most recent en_US and en_CA to Apache OpenOffice extension moments ago. :-P Kind regards, Marco A.G.Pinto - Forwarded Message Subject:English dictionaries updated - 2014-09-01 Date: Mon, 01 Sep 2014 00:43:23 +0100 From: Marco A.G.Pinto marcoagpi...@mail.telepac.pt Reply-To: marcoagpi...@mail.telepac.pt To: Mailing List - AOO - Development d...@openoffice.apache.org, Mailing List - AOO - l10n l...@openoffice.apache.org, Mailing List - AOO - QA q...@openoffice.apache.org Hello! Just to inform everyone that I have updated the English dictionaries: - en_US (Kevin) - en_CA (Kevin) - en_GB (Marco) en_GB 2.16 has around 2000 new words since 2.14 (included in the previous extension). You can download it from: http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice The GitHub where the wordlists are kept is: https://github.com/marcoagpinto/aoo-mozilla-en-dict My Proofing Tool GUI site project: http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html Thanks for your time! Kind regards, Marco A.G.Pinto -- -- -- Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Suggesting corrections for some English acronyms
That is done by Hunspell (the dictionary). When a word has a least one letter in upper case in the .DIC and the user places the word differently, it suggests to change it. Kind regards, Marco A.G.Pinto -- On 28/08/2014 13:53, Robin Dunn wrote: Hi, How does LT deal with acronyms? I'd like to identify English acronyms which have been written in lowercase but should be uppercase. I can see that LT does offer corrections for some acronyms e.g. uk UK, unicef UNICEF but corrections are not suggested for the following: uae UAE United Arab Emirates. The regeneration of Dubai has led to a surge in tourism throughout the uae. mp MP Member of Parliament The mp was one of the Britain's most notorious politicians. hazmat HAZMAT hazardous material The hazmat driver was working all day. What would be the best way to add these? Thanks Robin. -- Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/ ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- -- Slashdot TV. Video for Nerds. Stuff that matters. http://tv.slashdot.org/___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Compounds - possible workaround for pt_PT pre+post reform
Hello! I believe I have found a way of using pre and post reform compounds. Please take a look at this screenshot: By having that checkbox selected, it uses: *pre-reform-compounds.txt* Can't we just add another checkbox to use: *post-reform-compounds.txt* ? This is a workaround, but the best would be for mother tongue to allow to select pre and post. Thanks! Kind regards, Marco A.G.Pinto -- -- -- ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
en_GB - Dictionary - 2.15 - 1-AUG-2014
Hello! Here is the monthly update. Since I was in vacation for almost three weeks, I have been able to add 1037 words. This makes a total of nearly 5000 new words since I embraced the project a year ago. The files can be downloaded from: https://github.com/marcoagpinto/aoo-mozilla-en-dict Thanks! Kind regards, Marco A.G.Pinto --- -- -- Infragistics Professional Build stunning WinForms apps today! Reboot your WinForms applications with our WinForms controls. Build a bridge from your legacy apps to the future. http://pubads.g.doubleclick.net/gampad/clk?id=153845071iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel