Re: LT2.8 not working on LO4.1.6.2
From: Daniel Naber daniel.na...@languagetool.org Subject: Re: LT2.8 not working on LO4.1.6.2 On 2015-01-09 09:49, Kumara Bhikkhu wrote: I've just installed LT2.8 on LO4.1.6.2 and it doesn't work. No squiggly lines under issues detected by LT2.7. I cannot reproduce this with LO 4.1.3.2, so I can only point to https://languagetool.org/issues/. Installation problems are not uncommon, but 95% of them can be solved with the hints on that page. It's ok. It's working now. Not sure what happened earlier. kb -- New Year. New Location. New Benefits. New Data Center in Ashburn, VA. GigeNET is offering a free month of service with a new server in Ashburn. Choose from 2 high performing configs, both with 100TB of bandwidth. Higher redundancy.Lower latency.Increased capacity.Completely compliant. http://p.sf.net/sfu/gigenet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
LT2.8 not working on LO4.1.6.2
I've just installed LT2.8 on LO4.1.6.2 and it doesn't work. No squiggly lines under issues detected by LT2.7. FYI, I've not upgraded from LO4.1.6.2 because of a regression bug. kb -- Dive into the World of Parallel Programming! The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Some might say... Error
LT flags say and suggests says and saith. kb -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Some might say... Error
Correction: The issue only comes up if the string continues with that: Some might say that Kumara Bhikkhu wrote thus at 05:11 PM 01-12-14: LT flags say and suggests says and saith. kb -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Transitive-Intransitive
There's postag for transitive and intransitive, is there? kb -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Once (false alarm)
Once, I noticed tension on my head. I've looked up the explanation the rule linked to. Don't seem to apply here at all. kb -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://p.sf.net/sfu/Zoho___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
I've a
Hi, everyone! I wonder if this is a bit too nitpicky: Emacs! I googled I've a, and found it quite common: * I've a flat tyre. * I've a message from the Lord, hallelujah! * I've a Laddie in America. * Bridget Christie: 'I've a long way to go' | Culture | The Guardian * Lucy Mangan: I've a shameful confession to make * I've an idea. To me, it's just how a casual verbal expression is accurately reproduced in writing. I agree though that this may be deemed inappropriate in scholarly writing. kb -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
out of the way
Need exception here: Emacs! kb -- Infragistics Professional Build stunning WinForms apps today! Reboot your WinForms applications with our WinForms controls. Build a bridge from your legacy apps to the future. http://pubads.g.doubleclick.net/gampad/clk?id=153845071iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: help with English style rules
Marcin MiÅkowski wrote thus at 04:47 PM 20-07-14: W dniu 2014-07-17 06:12, Kumara Bhikkhu pisze: Excellent answers. I'm no native speaker, but hope you don't mind me adding. Perhaps the only place where LT could *suggest* a comma after for example is when it begins the sentence. I think there will be also genuine cases of a missing comma whenever for example is followed by a noun or an adjective. That'll still be a question of style, right? E.g.: Not all birds can fly, for example ducks. kb -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: English native speaker help
Marcin MiÅkowski wrote thus at 05:07 PM 20-07-14: I made this rule a long time ago. Now, I made it off by default and added a URL for an explanation. Some style guides consider this incorrect, so I leave the rule in the file for users that might need it. A very reasonable decision indeed for rules that experts disagree. Joseph Williams makes good sense about this issue of Correctness in his book STYLE: Ten Lessons in Clarity Grace. kb -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
RE: English native speaker help
Mike Unwalla wrote thus at 11:58 PM 18-07-14: The hearing is being rushed because the principle is going out of town. LT: This word is normally spelled with hyphen: out-of-town False alarm. 'Out-of-town' as an adjective is fine. Example, The new out-of-town shopping center is very popular. In this sentence, 'out of town' is not an adjective. Perhaps it'll help that the rule specifies a following noun. Btw, Mike's grasp of English is excellent. kb -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
RE: help with English style rules
Excellent answers. I'm no native speaker, but hope you don't mind me adding. Perhaps the only place where LT could *suggest* a comma after for example is when it begins the sentence. kb Mike Unwalla wrote thus at 02:13 AM 17-07-14: I agree that 'for example' does not have to be followed by a comma. Sometimes, 'for example' is at the end of a sentence. You can write this, for example. Regards, Mike Unwalla Contact: www.techscribe.co.uk/techw/contact.htm -Original Message- From: Robin Dunn [mailto:rd...@iparadigms.com] Sent: 16 July 2014 17:35 To: development discussion for LanguageTool Subject: Re: help with English style rules Hi, I am a native English speaker and I agree with the answer at the following link which advises 'for example' does not always have to be followed by a comma. http://english.stackexchange.com/questions/132359/any-exception-with-commas-before-and-after-for-example From the link here's an example of a valid case where a comma is not required after 'for example': While it is common practice to do recalibration between trials, for example in reading research, this is not always possible or feasible. I'm not aware of a complete comprehensive English grammar guide which is freely available online but I think english.stackexchange.com is an excellent resource for these kinds of questions. Regards Robin On Wed, Jul 16, 2014 at 5:04 PM, Daniel Naber daniel.na...@languagetool.org wrote: Hi, someone suggests that for example should always be followed by a comma: https://github.com/languagetool-org/languagetool/issues/136 My question to the English native speakers: 1.) Do you agree with this? 2.) Is there an established style guide that we can use as a reference for questions like these? If possible, it should be available on the internet for free, and it should also be comprehensive enough to answer most questions. Regards Daniel -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
AftertheDeadline
I recall noticing some useful style suggestions when I was using AftertheDeadline. Has all the rules there been integrated into LT? kb -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Marcin MiÅkowski wrote thus at 03:45 PM 01-06-14: The error is found also online using our form at http://www.languagetool.org Yes, I found out. (I used http://community.languagetool.org/homepage/simpleCheck?lang=en) Maybe it could be useful for you to use the form first. Anyway, if there's no match in your installation, it could mean that you disabled the rule in OO/LO. Indeed it's disabled, but not by me. Checking further, I notice that some other rules that were enabled have also been disabled. Strange. kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Marcin MiÅkowski wrote thus at 07:31 PM 30-05-14: Hm, there's already THIS_NNS[2] rule that finds these|those + singular noun. Is there any mistake that it does not find? It definitely detects the mistake as specified in your example above. You could have told me earlier, you know? Never mind. My analytical side of the brain probably needs some workout. Here's what it doesn't catch: I find those translation misleading. I've just installed the latest snapshot to be sure. kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Marcin MiÅkowski wrote thus at 04:00 PM 31-05-14: W dniu 2014-05-31 08:29, Kumara Bhikkhu pisze: Here's what it doesn't catch: *I find _those translation_ misleading. It does: 1.) Line 1, column 8, Rule ID: THIS_NNS[2] Message: Did you mean 'this translation' or 'those translations'? Suggestion: this translation; those translations I find those translation misleading. ^ *I've just installed the latest snapshot to be sure. Don't you get a match as above? Nope, but pardon me, my latest is LanguageTool-20140529-snapshot.oxt. Now 2 days old. Perhaps you made some changes recently. (FYI, I don't have good connection where I am, and rely on others to get large files.) kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Old versions of LT remains in LO user profile
I notice that remnants of old versions remains in LO's user profile. Is this LT's bug (or feature?) or LO's? kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Old versions of LT remains in LO user profile
Oh... I wanted to ask in what way is it a feature, but never mind. I notice that the remnants for official releases of LT are negligible, while the snapshots are taking about 76MB each! kb Dmitri Gabinski wrote thus at 07:33 PM 31-05-14: When upgrading an existing version of LT, first uninstall a previous one, close and install a new one. This should prevent the remnants. And yes, this is a feature of OpenOffice. Best regards, Dmitri Gabinski 31.05.2014 14:03 полÑзоваÑÐµÐ»Ñ Kumara Bhikkhu mailto:kumara.bhik...@gmail.comkumara.bhik...@gmail.com напиÑал: I notice that remnants of old versions remains in LO's user profile. Is this LT's bug (or feature?) or LO's? kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Daniel Naber wrote thus at 03:26 PM 30-05-14: On 2014-05-30 03:47, Kumara Bhikkhu wrote: You cannot use the online editor to get the singular form of a plural word in the suggestion. OK. Got it. How do I do it manually then? I meant how do I indicate in the XML to suggest the singular form of a plural word, or vice versa? (Sorry if this is already in some help text that I can't find.) kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Kumara Bhikkhu wrote thus at 04:28 PM 30-05-14: I meant how do I indicate in the XML to suggest the singular form of a plural word, or vice versa? Never mind. I've got it by looking at existing rules. Had to split to 2 rules though: one for these and one for those. rule id=ID name=These + Singular Noun pattern tokenthese/token token postag='NN|NN:UN' postag_regexp='yes'exception postag='IN|VBP' postag_regexp='yes'/exception chunk=E-NP-singular//token /pattern message\1 and \2 do not match grammatically./message suggestion\1 match no=2 postag=NNS//suggestion suggestionthis \2/suggestion shortGrammar/short example type='incorrect'I find markerthese translation/marker misleading./example example type='correct'I find these translations misleading./example example type='correct'I find this translation misleading./example /rule rule id=ID name=Those + Singular Noun pattern tokenthose/token token postag='NN|NN:UN' postag_regexp='yes'exception postag='IN|VBP' postag_regexp='yes'/exception chunk=E-NP-singular//token /pattern message\1 and \2 do not match grammatically./message suggestion\1 match no=2 postag=NNS//suggestion suggestionthat \2/suggestion shortGrammar/short example type='incorrect'I find markerthose translation/marker misleading./example example type='correct'I find those translations misleading./example example type='correct'I find that translation misleading./example /rule -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: These|Those + Singular Noun
Daniel Naber wrote thus at 08:03 PM 29-05-14: You cannot use the online editor to get the singular form of a plural word in the suggestion. OK. Got it. How do I do it manually then? kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Also without comma
Current LT flags sentences beginning with Also without a comma, and suggest adding a comma. I think and exception should be made when the following word is a verb. E.g.: Also specify your gender. kb -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: possible new English rule
Looks useful. Like to suggest a change of name though. Your message says a/the + infinitive, while the name suggest a mere typo. How about a|an|the + infinitive for the name? You could also leave out the an argue in. kb Jaume Ortolà i Font wrote thus at 07:46 PM 28-05-14: Could it be a useful rule? !-- English rule, 2014-05-28 -- rule id=ID name=a compete/complete  pattern  token regexp='yes'a|an|the/token  token postag='VB|VBP' postag_regexp='yes'exception postag='VB|VBP' postag_regexp='yes' negate_pos='yes'/exception/token  /pattern  messageProbably a bad construction: a/the + infinitive/message  example type='incorrect'markera compete/marker catastrophe an argue in/example  example type='correct'a complete catastrophe/example  example type='correct'a show/example /rule Regards, Jaume Ortolà -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: I've create a rule.
Daniel Naber wrote thus at 02:57 PM 22-05-14: On 2014-05-22 08:46, Kumara Bhikkhu wrote: Problem is I don't know how to look for the rule. Perhaps the one who created it can do that? You can put the sentence into the field at languagetool.org, check the text and click the error. There's an item Rule implementation that will lead you to a page where you can click Show XML or directly go to the new rule editor. Thanks. I've tried. Looks too complicated for me! kb -- The best possible search technologies are now affordable for all companies. Download your FREE open source Enterprise Search Engine today! Our experts will assist you in its installation for $59/mo, no commitment. Test it for FREE on our Cloud platform anytime! http://pubads.g.doubleclick.net/gampad/clk?id=145328191iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: I've create a rule.
Daniel Naber wrote thus at 07:38 PM 20-05-14: On 2014-05-20 10:23, Kumara Bhikkhu wrote: For this sentence, LT flags create, but the error may be the ve. The same is true for I'll created a rule. Should this rule be improved upon? That would be nice. English tenses are very difficult for non-native speakers, so more suggestions could be useful, as long as they are accompanied by a helpful description of the semantics (well, maybe a URL pointing to a web page might be best, as tenses probably cannot be explained in a short message). Problem is I don't know how to look for the rule. Perhaps the one who created it can do that? kb -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
I've create a rule.
For this sentence, LT flags create, but the error may be the ve. The same is true for I'll created a rule. Should this rule be improved upon? kb -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: New rule for English
It's true there's no mistake. You can consider this an optional rule for concision freaks like me. Besides, in the modern world of language style, there's a trend away from wordiness common in classical English. (The particular structure is necessary in some circumstances though.) Btw, I don't follow Strunk and White. More in line with Joseph M. Williams. You might be interested in his book: Style: Lessons in Clarity and Grace. kb Marcin MiÅkowski wrote thus at 03:06 PM 19-05-14: W dniu 2014-05-19 05:21, Kumara Bhikkhu pisze: Please consider adding this. I'm unable to test it due to the and. Well, I don't see any mistake being detected here. It was/is... that is a way to express stress on some facts in the statement. This is perfect English and you could probably find Jane Austin or Charles Dickens using such constructions. Therefore, I'm not really sure if we need to be stricter than good writers. This belongs to the deplorable tradition of nit-picking absurd advice such as Strunk and White disingenuous 'The Elements of Style', which criticized perfect English (such as 'split infinitive' that has never been a mistake in reality). Regards, Marcin rule id=It was/is... that name=It was/is... that pattern tokenIt/token token regexp='yes'was|is/token and token chunk=B-NP-singular min=0/ token postag=PRP$|PDT|POS postag_regexp=yes/ /and token chunk=I-NP-singular min=0 max=-1/ token chunk=E-NP-singular/ tokenthat/token /pattern messageYou may want to make this concise./message suggestion\3 \4 \5/suggestion shortWordiness/short example type='incorrect'markerIt was her last argument that/marker finally persuaded me./example example type='correct'Her last argument finally persuaded me./example /rule -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
noun phrase one or multiple words long
Can someone tell me how I can create a pattern that can match a noun phrase that can be one or multiple words long? kb -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
New rule for English
Please consider adding this. I'm unable to test it due to the and. rule id=It was/is... that name=It was/is... that pattern tokenIt/token token regexp='yes'was|is/token and token chunk=B-NP-singular min=0/ token postag=PRP$|PDT|POS postag_regexp=yes/ /and token chunk=I-NP-singular min=0 max=-1/ token chunk=E-NP-singular/ tokenthat/token /pattern messageYou may want to make this concise./message suggestion\3 \4 \5/suggestion shortWordiness/short example type='incorrect'markerIt was her last argument that/marker finally persuaded me./example example type='correct'Her last argument finally persuaded me./example /rule -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
False flag: I had to say no to them
Found a false flag. In this sentence: I had to say no to them. LT flags no, saying it probably should be now. kb -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Readability: sentence over 40 words
Thanks. Some more... (AN10.72) is consider as 5 words by the rule. You can use this (36 words plus that string) to test: One Two Three Four Five Six Seven Eight Nine Ten One Two Three Four Five Six Seven Eight Nine Ten One Two Three Four Five Six Seven Eight Nine Ten One Two Three Four Five Six (AN10.72). kb Daniel Naber wrote thus at 04:35 PM 03-05-14: On 2014-05-03 05:40, Kumara Bhikkhu wrote: However there remains another bug with word counting for Readability: sentence over 40 words: It counts all quote marks (single, double, normal, curly) as words too. Thanks, that should be fixed now. Regards Daniel -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available. Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Is your legacy SCM system holding you back? Join Perforce May 7 to find out: #149; 3 signs your SCM is hindering your productivity #149; Requirements for releasing software faster #149; Expert tips and advice for migrating your SCM now http://p.sf.net/sfu/perforce___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Readability: sentence over 40 words
Daniel Naber wrote thus at 08:07 PM 05-05-14: On 2014-05-05 09:47, Kumara Bhikkhu wrote: Thanks. Some more... _(AN10.72)_ is consider as 5 words by the rule. I have just fixed this. It will still be two words because of the dot, but I guess we can live with that. Yes, I can tolerable that. :-) Thanks! kb -- Is your legacy SCM system holding you back? Join Perforce May 7 to find out: #149; 3 signs your SCM is hindering your productivity #149; Requirements for releasing software faster #149; Expert tips and advice for migrating your SCM now http://p.sf.net/sfu/perforce ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: English words with hyphens
That's reasonable. People can be using hyphens for various reason to combine words. This allowance frees you from having to update LT for every new combination. Daniel Naber wrote thus at 09:32 PM 21-04-14: Hi, replying to an old thread here: I have set English to accept words with hyphens if the parts are valid words. For example, web-based is accepted now. Unusual or wrong hyphenations will also be accepted, but there were just too many false alarms for my taste. Regards Daniel -- Start Your Social Network Today - Download eXo Platform Build your Enterprise Intranet with eXo Platform Software Java Based Open Source Intranet - Social, Extensible, Cloud Ready Get Started Now And Turn Your Intranet Into A Collaboration Platform http://p.sf.net/sfu/ExoPlatform ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: prototype of new rule editor
I love it! Amazing work. Minor issue: Is the A example sentence in the 2 text boxes deliberately wrong? kb Daniel Naber wrote thus at 12:51 AM 18-03-14: Hi, there's now a prototype of a new rule editor available at http://community.languagetool.org/ruleEditor2/. Main features are: * Checks the example sentence against known errors so nobody wastes time writing a rule that already exists * Has text analysis (POS tags, lemmas, chunks) integrated * Checks rule against a part of the Wikipedia/Tatoeba corpus to help avoid false alarms The basic workflow idea is to start with two example sentences, a wrong one and its corrected version. A (trivial) pattern is then generated automatically, which is just the word(s) that differ in the wrong and corrected sentence. The user then needs to add more tokens to make the rule complete. Finally, it is checked against Wikipedia/Tatoeba. Several things are not supported yes, but please give it a try anyway. Regards Daniel -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/13534_NeoTech ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Bug: More or _less_
Marcin MiÅkowski wrote thus at 07:22 PM 08-03-14: Maybe a pattern for inflected plural be + more or less would do it in this case. Indeed, there are multiple cases of this. Yes, but it's safer to create an exception for all cases of more or less. Besides, speaking for descriptive grammar, less is now widely used for both countable and uncountable nouns. Fewer is hardly used at all. with mettâ, Kumâra Bhikkhu -- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Bug: More or _less_
Here's where it came from: Others however are more or less combinations of both samatha and... You will find in dictionaries that more or less is an idiom that functions as one adverb. In the above, it means to some degree. In http://idioms.thefreedictionary.com/more+or+less, we find an example: This room is more or less an extra - we don't really need it. kb Marcin MiÅkowski wrote thus at 05:08 PM 08-03-14: There can be a verb or an uncountable noun, or an adverb. In the corpus (GLOWBE) all matches of more or less + plural noun turned out to be verbs misrecognized as nouns. I may be wrong but I think more or less combinations is not standard English. If you can find Charles Dickens or Jane Austin, or anything that is definitely standard English, with this expression, I will add an exception to the rule. Regards, Marcin -- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Should probably be uppercase?
Marcin, I've installed LanguageTool-20140301-snapshot.oxt, but forgot to turn off the capitalization option. Here's an example: Emacs! Notice that the d in you'd is also flagged--for the same rule. Also flagged for the same is whomever. kb inline: d4d166.jpg-- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: new online feature: text analysis
Daniel Naber wrote thus at 03:47 PM 04-03-14: On 2014-03-04 05:32, Kumara Bhikkhu wrote: Is this a bug? is taken as I-NP-plural In which sentence? We use a statistical approach to find the chunks, so they will not be 100% correct. I see. It's from a title which I pasted just for testing: How to Deal with Negative Thoughts Feelings kb -- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Should probably be uppercase?
Marcin MiÅkowski wrote thus at 04:36 PM 03-03-14: Let me think, it seems you activated a rule that I used only to process lowercase-only sentences (after machine translation). The rule is off by default, so you had to put a check mark in the configuration dialog yourself. Frankly, this rule is off by default because it really works only for really garbled input. I will remove it altogether. You mean this one? Emacs! I thought that was dealing with the i in i am only. OK. It's off! Thanks. kb inline: 54e901a.jpg-- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Should probably be uppercase?
Since such false alarms are quite frequent (even two is flagged as should probably be uppercase), it makes LT look bad. So, do consider an early public release. kb -- Subversion Kills Productivity. Get off Subversion Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Should probably be uppercase?
Daniel Naber wrote thus at 04:48 PM 01-03-14: I cannot reproduce that. That's good. Meaning it's already fixed in the latest nightly. Does it also happen with the stand-alone version of LT? Are you using the latest version (2.4 or nightly snapshot)? LT2.4 LO ext. kb -- Flow-based real-time traffic analytics software. Cisco certified tool. Monitor traffic, SLAs, QoS, Medianet, WAAS etc. with NetFlow Analyzer Customize your own dashboards, set traffic alerts and generate reports. Network behavioral analysis security monitoring. All-in-one tool. http://pubads.g.doubleclick.net/gampad/clk?id=126839071iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: new rule: check space between sentences
Daniel Naber wrote thus at 06:18 PM 19-02-14: This rule will cause new false alarms for words like X.Org. If it's only when the first character after the dot is in cap, then it's okay. Otherwise, we've a lot to deal with: .com, .exe, .org, .uk, .txt, .net, .my, .tv... kb -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121054471iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: new rule: check space between sentences
That seems reasonable. Anyway, time will tell if the current implementation is okay. kb R.J. Baars wrote thus at 10:42 PM 19-02-14: In fact, I suggest only lowercasefull stopUppercase should be reported. Ruud Daniel Naber wrote thus at 06:18 PM 19-02-14: This rule will cause new false alarms for words like X.Org. If it's only when the first character after the dot is in cap, then it's okay. Otherwise, we've a lot to deal with: .com, .exe, .org, .uk, .txt, .net, .my, .tv... kb -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121054471iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Unclear to some users that errors in LT main page are made on purpose
The issue is the first sentence is correct, setting the expectation for the rest of the text. Current: Paste your own text here. or check this text too see an few of of the problems that LanguageTool can detecd. What do you thinks of grammer checkers? Please not that they are not perfect. Suggestion: Check this text too see an few of of the problems that LanguageTool can detecd. what do you thinks of grammer checkers? Please not that they are not perfect. For Paste your own text here. I suggest having this below the box: You can also paste your own text for checking. kb Dominique Pellé wrote thus at 08:54 PM 16-02-14: Hi I've received several times already emails from users signaling errors in the main page of LanguageTool in the example sentence. Of course, these are errors done on purpose to show what LanguageTool can find. However, the fact that I received those emails shows that it's unclear to some users. So how about we highlight the mistakes in the example sentences even without having to press Check Text? That would make it clearer that errors are made on purpose. Any other idea? Regards Dominique -- Android apps run on BlackBerry 10 Introducing the new BlackBerry 10.2.1 Runtime for Android apps. Now with support for Jelly Bean, Bluetooth, Mapview and more. Get your Android app in front of a whole new audience. Start now. http://pubads.g.doubleclick.net/gampad/clk?id=124407151iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Android apps run on BlackBerry 10 Introducing the new BlackBerry 10.2.1 Runtime for Android apps. Now with support for Jelly Bean, Bluetooth, Mapview and more. Get your Android app in front of a whole new audience. Start now. http://pubads.g.doubleclick.net/gampad/clk?id=124407151iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Earlier public release?
It still leaves some room for a variety of ideas... What could it mean? The underline words are flagged by LT 2.4. It doesn't look good on LT. If these bugs are fixed, I\d like to suggest an earlier public release. kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: E-mail I sent became stuck for approval?
I've seen a posting of yours get into my gmail spam box. There might have been others. It may be due to your elaborate image signature. I wonder if you could tone that down, or even do without it for the discussion list. kb Marco A.G.Pinto wrote thus at 02:28 AM 01-02-14: Hello! I sent an e-mail an hour ago and it didn't appear yet. Has it become stuck for approval because of its size? Thanks! Kind regards, Marco A.G.Pinto --- -- -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
100 sentences?
From http://community.languagetool.org/ruleEditor/expert We've checked your pattern against 100 sentences from the English Wikipedia and from Tatoeba found no matches. That's a good sign, it means your rule doesn't trigger any false alarms at least in the documents we checked. Just 100 sentences? kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Big in size, etc
ok. agreed. Marcin MiÅkowski wrote thus at 05:03 PM 30-01-14: W dniu 2014-01-30 04:14, Kumara Bhikkhu pisze: Of what you listed, these are rightly flagged: * few in number * religious in nature * autocratic in nature, Removing the 2nd and 3rd token wouldn't cause any lost of meaning. Well, in general, I'm skeptical about inclusion of in nature. In many instances, it means in fact, which is just a matter of stress (but not redundant or pompous at all). But there is a number of uses where it means by necessity or in essence, which argues that something essentially or necessarily has a certain property. This is proper usage, and your rule would conflate necessity judgments and actuality judgments, which is simply a logical fallacy. Let me quote the passage from the Brown corpus: The men who speculate on these institutions have, for the most part, come to at least one common conclusion: that many of the great enterprises and associations around which our democracy is formed are in themselves autocratic in nature, and possessed of power which can be used to frustrate the citizen who is trying to assert his individuality in the modern world. Here, by saying autocratic, and not autocratic in nature, one would change the logical structure of the passage, and make the point ambiguous. The writer wanted to say that some apparently democratic institutions are autocratic in character, or in their real essence (so they are autocratic necessarily, not contingently). To apply the correction offered by BIG_IN_SIZE to this passage is to distort the logical structure of the passage. I will probably remove 'nature' from the rule altogether. Regards, Marcin kb Marcin Mià kowski wrote thus at 011:33 AM 30-01-14: W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze: Hope this is not too greedy. rule id=BIG_IN_SIZE name=Big in size, etc pattern token postag_regexp=yes postag=JJ|JJR|JJSexception regexp=yessimilar|alone/exceptionexception postag=VBN//token tokenin/token token regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token /pattern messageA more concise phrase may lose no meaning and sound more powerful./message suggestion\1/suggestion shortPossible redundancy/short example type=correctThe man is big./example example type=correctHis speech was briefest./example example type=correctHe's absorbed in thought./example example correction=big type=incorrectThe man is markerbig in size/marker./example example correction=briefest type=incorrectHis speech was markerbriefest in duration/marker./example example correction=redder type=incorrectMy car is markerredder in color/marker./example example correction=few type=incorrectHer friends were markerfew in number/marker./example /rule Thanks! Yes, this is much better but we also match: rich in color few in number possible in Nature, corresponding in number religious in nature efficient in shape autocratic in nature, equal in size beautiful in color, next in size Out of 16 matches on Brown corpus, the useful ones were brown in color,tawny in color, two in number. I added the adjectives above as additional exceptions and we'll see how it performs. I'm not sure about 'few'. We'll probably see more matches in the nightly diff today. Regards, Marcin -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply
Would this affect LT on LibO? (Fwd: [tdf-announce] LibreOffice 4.2)
Date: Thu, 30 Jan 2014 13:05:04 +0100 From: Italo Vignoli italo.vign...@documentfoundation.org Organization: The Document Foundation snip LibreOffice 4.2 is the first open source suite to ship a new Windows (IAccessible2 based) accessibility feature developed by IBM. This is considered experimental for this release, but will replace legacy Java based accessibility in the next major release. -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Discrepancy in user rule filename?
In the Standalone, we have Load Rule File..., which leads to a dialog box asking for a rules*.xml. While this is ignored, it insists that the file name should start with rules. Yet, the guideline in Development for adding a rule file says user-rules.xml Should they agree? kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
LT @ Libreoffice.org Extension Center
Dear all, Presently, LT is listed at #4 at http://extensions.libreoffice.org/extension-center with 59 likes. If you're using LO with the LT extension, consider adding a like there. FYI, LT is now behind * Clipart gallery of danger signs 88 likes, 1 dislikes * Copy only visible cells 69 likes, 0 dislikes * TexMaths 64 likes, 1 dislikes kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Big in size, etc
Hope this is not too greedy. rule id=BIG_IN_SIZE name=Big in size, etc pattern token postag_regexp=yes postag=JJ|JJR|JJSexception regexp=yessimilar|alone/exceptionexception postag=VBN//token tokenin/token token regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token /pattern messageA more concise phrase may lose no meaning and sound more powerful./message suggestion\1/suggestion shortPossible redundancy/short example type=correctThe man is big./example example type=correctHis speech was briefest./example example type=correctHe's absorbed in thought./example example correction=big type=incorrectThe man is markerbig in size/marker./example example correction=briefest type=incorrectHis speech was markerbriefest in duration/marker./example example correction=redder type=incorrectMy car is markerredder in color/marker./example example correction=few type=incorrectHer friends were markerfew in number/marker./example /rule -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
They got it right; they got it wrong.
False alarm: They got it right; they got it wrong. right and wrong are wrongly flagged. kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Big in size, etc
Of what you listed, these are rightly flagged: * few in number * religious in nature * autocratic in nature, Removing the 2nd and 3rd token wouldn't cause any lost of meaning. kb Marcin MiÅkowski wrote thus at 01:33 AM 30-01-14: W dniu 2014-01-29 10:35, Kumara Bhikkhu pisze: Hope this is not too greedy. rule id=BIG_IN_SIZE name=Big in size, etc pattern token postag_regexp=yes postag=JJ|JJR|JJSexception regexp=yessimilar|alone/exceptionexception postag=VBN//token tokenin/token token regexp=yessize|duration|color|colour|number|shape|height|nature|length|weight/token /pattern messageA more concise phrase may lose no meaning and sound more powerful./message suggestion\1/suggestion shortPossible redundancy/short example type=correctThe man is big./example example type=correctHis speech was briefest./example example type=correctHe's absorbed in thought./example example correction=big type=incorrectThe man is markerbig in size/marker./example example correction=briefest type=incorrectHis speech was markerbriefest in duration/marker./example example correction=redder type=incorrectMy car is markerredder in color/marker./example example correction=few type=incorrectHer friends were markerfew in number/marker./example /rule Thanks! Yes, this is much better but we also match: rich in color few in number possible in Nature, corresponding in number religious in nature efficient in shape autocratic in nature, equal in size beautiful in color, next in size Out of 16 matches on Brown corpus, the useful ones were brown in color,tawny in color, two in number. I added the adjectives above as additional exceptions and we'll see how it performs. I'm not sure about 'few'. We'll probably see more matches in the nightly diff today. Regards, Marcin -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: homepage usability test
Kumara Bhikkhu wrote thus at 12:52 PM 30-01-14: Daniel Naber wrote thus at 06:34 AM 30-01-14: -sometimes they don't get that the demo text is a demo and thus contains errors on purpose I too had the same experience, though not for the demo text on the old homepage. I think revising the text with Check this text too see an Or paste your own text here. would minimise the possibility. kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Token postag OR word
Can a token be a mixture of postags and words? Example: Can a token match send_end or of|into? If not, how do I indicate this? kb -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Try this
rule id=CONDUCT_A name='Conduct' nominalization pattern token regexp=yesconduct.*/token token regexp=yesa|an|no/token token min=0 max=3/ token postag_regexp=yes postag=NN.*exception postag_regexp=yes postag=NNP.*/exceptionexception regexp=yesseries|school|operation/exception/token token regexp=yesof|into/token /pattern messageConsider revising this phrase to use the verb form of \4./message urlhttp://web.cn.edu/kwheeler/gram_nominalization.html/url shortNominalization/short example type=incorrectYou must markerconduct a review of/marker the data./example example type=correctYou must review the data./example /rule -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Token postag OR word
Thank, Jaume. I didn't know there's such to tag. Jaume Ortolà i Font wrote thus at 04:59 PM 28-01-14: 2014-01-28 Kumara Bhikkhu mailto:kumara.bhik...@gmail.comkumara.bhik...@gmail.com Can a token be a mixture of postags and words? Example: Can a token match send_end or of|into? If not, how do I indicate this? Yes, you can write this: or token postag=SENT_END / token regexp=yesof|into/token or It's equivalent to using two similar rules at the same time, one with sent_end and the other one with of|into. This is a recent addition to the syntax. I am not sure if the backreferences work always properly with the or-statement. Jaume -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Try this
Marcin MiÅkowski wrote thus at 08:06 PM 28-01-14: Try: token inflected=yesconduct/token instead. Thanks for the tip. I was wondering about it. I tried your rule using our rule editor: http://community.languagetool.org/ruleEditor/expert and we have lots of matches, but some of them are quite useless, for example: Mozart's Davide penitente (1785), his Piano Concerto KV 482 (1785), the Clarinet Quintet (1789) and the 40th Symphony (1788) had been premiered on the suggestion of Salieri, who supposedly conducted a performance of it in 1791. (wikipedia) It's relevant actually. It can be revised as who supposedly performed it in 1791. Unnecessary nominalisation is so common these days that most people put up with it to the point of ignoring it. Students often use it to make their writing sound more pompous, which is counterproductive. You can use the following queries on corpus.byu.edu corpora: conduct a|an|the|no [n*] of|into conduct a|an|the|no [n*] * of|into conduct a|an|the|no [n*] * * of|into conduct a|an|the|no [n*] * * * of|into Yes, I'm using that (minus the, which suggest that the noun refers to a specific thing mentioned earlier, thus the nominalisation is necessary). peace Kumâra Bhikkhu -- WatchGuard Dimension instantly turns raw network data into actionable security intelligence. It gives you real-time visual feedback on key security issues and trends. Skip the complicated setup - simply import a virtual appliance and go from zero to informed in seconds. http://pubads.g.doubleclick.net/gampad/clk?id=123612991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: [bug] Ignore in LibreOffice
Kumara Bhikkhu wrote thus at 12:07 PM 23-01-14: Marcin MiÅkowski wrote thus at 05:22 PM 13-01-14: I wouldn't hold my breath, however. The development of LO/OO in the linguistic part seems to have come to a complete halt, and there are numerous simple bugs that are still not fixed after several years... Anyway, I'm trying: https://bugs.freedesktop.org/show_bug.cgi?id=73868 FYI, the bug is now assigned. That didn't take long. :-) kb -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: [bug] Ignore in LibreOffice
Marcin MiÅkowski wrote thus at 05:22 PM 13-01-14: I wouldn't hold my breath, however. The development of LO/OO in the linguistic part seems to have come to a complete halt, and there are numerous simple bugs that are still not fixed after several years... Anyway, I'm trying: https://bugs.freedesktop.org/show_bug.cgi?id=73868 kb -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: [bug] Ignore in LibreOffice
Daniel Naber wrote thus at 07:13 PM 09-01-14: On 2014-01-09 08:31, Kumara Bhikkhu wrote: When I click Ignore in the context menu, it turns off the rule for the whole document and even for other documents. That's because the option for the rule in LT is turned off. The expected behaviour is simply to unflag a particular instance. The text string Ignore comes from LO, so we cannot change that. What we could change is the behavior, e.g. to ignore the rule for now, but only until LO is restarted. We also cannot (easily) ignore only one instance of an error, because it's unclear how to recognize it. We'd need to look at the context, but then the question is: how much of the context. I recall that Word has a well thought out system for this. But anyway, can I confirm that this is not within the domain of LT? kb -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: or is an adjective?
I realise that the pattern needs to be better defined. But lots of funny things come up as well. E.g., besides or, even road bullfrog are adjectives. And lots of nouns that are not within NN:U|NN:UN are flagged too. Surely people will complain about it, but these aren't caused of the rule pattern. Could it be worth investigating? kb Marcin MiÅkowski wrote thus at 06:00 PM 02-01-14: W dniu 2014-01-01 11:55, Kumara Bhikkhu pisze: Daniel Naber wrote thus at 06:07 PM 01-01-14: On 2013-12-31 07:18, Kumara Bhikkhu wrote: I found a strangely flagged string: or in heaven. It's by one of my test rules. Thanks, I have fixed that by adding a rule to disambiguation.xml. I have also removed the NN:U reading which doesn't seem useful for lowercase or. Not sure if using disambiguation makes sense, but this way we don't have to touch the binary dictionary for such a small change. Well, I could just add or as an exception. It's weird how or got into the adjective category. JFYI, this is a rule for the common tendency by academics to say big in size, red in color, few in number, etc. It's less pompous to just say big, red, or few. I think it's good enough now. Shall send it to you privately. The rule is right now creating only false alarms, and none of the matches is actually an error (some of them match because of the lack of disambiguation but most of them match correctly): https://languagetool.org/regression-tests/20140101/result_en_20140101.html If you want to match big in size, then it's best to use these tokens, and not a sequence of POS tags, as POS tags will match correct phrases. I will remove this rule because it's not useful at all in its current version, unfortunately, and it will only cause people to complain. Best, Marcin -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: or is an adjective?
Daniel Naber wrote thus at 08:53 PM 02-01-14: On 2014-01-02 12:54, Marcin MiÅkowski wrote: Actually, road can be used as an adjective, for example in road sign. Other nouns like table, chair, and car are not in the dictionary as an adjective. Thanks for investigating, Daniel. Surely with car seat car functions as an adjective, but an adjectival noun is still a noun. If we take all adjectival nouns as adjectives as well, then kb -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: or is an adjective?
Daniel Naber wrote thus at 06:07 PM 01-01-14: On 2013-12-31 07:18, Kumara Bhikkhu wrote: I found a strangely flagged string: or in heaven. It's by one of my test rules. Thanks, I have fixed that by adding a rule to disambiguation.xml. I have also removed the NN:U reading which doesn't seem useful for lowercase or. Not sure if using disambiguation makes sense, but this way we don't have to touch the binary dictionary for such a small change. Well, I could just add or as an exception. It's weird how or got into the adjective category. JFYI, this is a rule for the common tendency by academics to say big in size, red in color, few in number, etc. It's less pompous to just say big, red, or few. I think it's good enough now. Shall send it to you privately. kb -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Tab vs Spaces in XML
I'm probably asking a novice question here. I see that the XMLs uses spaces instead of tabs. By using pretty print, a 707kb grammar.xml file is reduced to 590kb without any problem or lost of function. Doing this to other XML files would mean a significant reduction in weight for LT. So, why not? kb -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
or is an adjective
Hi, everyone. I found a strangely flagged string: or in heaven. It's by one of my test rules. Apparently or is taken as adjective: pattern token postag_regexp=yes postag=JJ|JJR|JJSexception postag=VBN//token tokenin/token token postag_regexp=yes postag=NN:UN|NN:Uexception postag_regexp=yes postag=NNPS|NNP|VBG//token /pattern Or, am I missing something? kb -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: showing example sentences in GUI
OK just provide an option (with a click) in the GUI to show the examples. gulp21 wrote thus at 04:20 AM 04-12-13: we have example sentences for every error in grammar.xml, and we show them on community.lt.org, but not in the stand-alone GUI. Is there any reason to not show them in the GUI when the user selects More...? Not yet committed, but it might look like this (see attachment). Do you want to show all example sentences to the user? I don't think that this is a good idea (e.g., the German rule PRAEP_GEN[1] has 15 examples, and some like Auf verschiedenen markerWegen den/marker Markt erobern. do not help the user). In general, I like the idea, but I'd prefer if it was possible to mark two examples (showingui=yes or the like) which will be displayed to the user. Regards Markus -- Sponsored by Intel(R) XDK Develop, test and display web and hybrid apps with a single code base. Download it for free now! http://pubads.g.doubleclick.net/gampad/clk?id=111408631iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Sponsored by Intel(R) XDK Develop, test and display web and hybrid apps with a single code base. Download it for free now! http://pubads.g.doubleclick.net/gampad/clk?id=111408631iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: help document for OO/LO installation
I think it's very well done. Daniel Naber wrote thus at 07:24 PM 21-11-13: Hi, as the installation of LT in LO can be a bit confusing, I have written a document that you can load to see if LT works for you: http://languagetool.org/issues/languagetool-test.odt Please let me know if you find problems with this or if you have ideas to improve it. It is linked on our common issues page at http://languagetool.org/issues/. Regards Daniel -- http://www.danielnaber.de -- Shape the Mobile Experience: Free Subscription Software experts and developers: Be at the forefront of tech innovation. Intel(R) Software Adrenaline delivers strategic insight and game-changing conversations that shape the rapidly evolving mobile landscape. Sign up now. http://pubads.g.doubleclick.net/gampad/clk?id=63431311iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Shape the Mobile Experience: Free Subscription Software experts and developers: Be at the forefront of tech innovation. Intel(R) Software Adrenaline delivers strategic insight and game-changing conversations that shape the rapidly evolving mobile landscape. Sign up now. http://pubads.g.doubleclick.net/gampad/clk?id=63431311iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: renaming languagetool-standalone.jar?
Perhaps there's no need to rename after all. Especially when it is placed with LT for LibO/OOo and LT for FF, it should be obvious that standalone really means it stands alone and don't work with something else. But should you think a rename is necessary, gui may be okay, but that might confuse others. At least standalone is more understandable to most. kb Daniel Naber wrote thus at 08:23 PM 09-11-13: Hi, a user was confused about the name languagetool-standalone.jar[1]. Does anybody see a problem with renaming the file to languagetool-desktop.jar? languagetool-gui.jar might even be more appropriate, but I assume not everybody knows that that means. [1]https://github.com/languagetool-org/languagetool/issues/29 -- http://www.danielnaber.de -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
sausage machine model - definition and examples of the sausage machine model in psycholinguistics
I wonder if this is of any use to the developers: http://grammar.about.com/od/rs/g/Sausage-Machine-Model.htm?nl=1 kb -- November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: to + 3rd person singular
Marcin, your feedback prompted me to consider further, and I come up with a false positive: The country Im going to suffers from poverty. Strangely, such a construct doesn't occur in the test. (Digression: It would be more concise to say The country Im going to is poor. and this rightly wouldn't be flagged.) Would it be expecting to much that LT be capable of detecting noun clauses? kb Kumara Bhikkhu wrote thus at 11:37 AM 02-11-13: Oops! What was I thinking. The brain must have been a bit foggy then. I should have been token postag=VBZexceptionis/exceptionexception postag=NNS//token Anyway I get that it wouldn't catch to gets and to says. Thanks. kb Marcin MiÅkowski wrote thus at 12:01 AM 02-11-13: Hello, there are several problems with your rule: W dniu 2013-11-01 11:45, Kumara Bhikkhu pisze: Like to propose adding this to en\grammar.xml rule id=TO_VBZ name=to + 3rd person singular pattern token postag_regexp=yes postag=VB|VBD|VBG|VBN|VBP|VBZ/ token min=0/ tokento/token marker token postag=VBZexceptionis/exceptionexception postag=yesNNS/exception/token This exception won't match anything, as there is no postag in English that includes quotation marks. But if I remove quotation marks, the examples marked as incorrect below do not work anymore because 'says' is also NNS, just like 'gets': both words can be used as plural nouns in some contexts. I mean these parts: example correction=say type=incorrectWhat would it have to markersays/marker?/example example correction=get type=incorrectShe is manipulating her father to markerget/marker her way./example I added a restricted version of your rule to our grammar.xml but we still have no way to include your two examples as incorrect. Regards, Marcin -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Unexpected start position of marker.../marker
Marcin MiÅkowski wrote thus at 12:07 AM 30-10-13: W dniu 2013-10-29 13:27, Daniel Naber pisze: On 2013-10-29 09:36, Kumara Bhikkhu wrote: What does this mean? Unexpected start position of marker.../marker in incorrect example sentence: 10 but expected 11 There's some strange special case in our code that moves the match one character. It's probably (more or less) a bug, but you can work around it like this: example type=incorrect correction=, writtenThis book,marker which was written/marker in 1913, has been read by many people./example In other words, the marker must be moved one character to the left to include the space. Note that , may be introduced in a wrong place if you don't include the previous token, as by default, a white space is added before every suggestion. If you want to override this behavior, you need to specify the previous token. Right. Don't limit the marker. Should work. I'm opting for Daniel's workaround though as it looks more elegant from the user's pov. kb -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Unexpected start position of marker.../marker
Daniel Naber wrote thus at 04:35 PM 30-10-13: On 2013-10-30 08:00, Kumara Bhikkhu wrote: Thanks. Your workaround worked. I thought you got it wrong as these seemed contradicting: * correction=, written * ,marker which That correction is not related to the marker, it is what the suggestion element in the message says. Ooh Now this cleared up the matter. OK. No bug there, or here now. :-) kb -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: pattern of tokens web search
This is great! Thanks a lot. Daniel Naber wrote thus at 04:42 PM 30-10-13: Content-Transfer-Encoding: 7bit On 2013-10-30 03:20, Kumara Bhikkhu wrote: Is there any way in the Internet to search the web for a pattern, like VB important? Yes, COCA can do that, see http://wiki.languagetool.org/tips-and-tricks#toc1 and the attached screen shot. Regards Daniel -- http://www.danielnaber.de -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Unexpected start position of marker.../marker
What does this mean? Unexpected start position of marker.../marker in incorrect example sentence: 10 but expected 11 rule id=WHICH_IS name=Wordiness: which is pattern token,/token marker token regexp=yeswhich|who/token token regexp=yesis|are|was|were/token token/ /marker /pattern messageShortening this to just suggestion, \4/suggestion may also be correct and sound better./message shortWordiness/short example type=incorrect correction=writtenThis book, markerwhich was written/marker in 1913, has been read by many people./example example type=correctThis book, written in 1913, has been read by many people./example /rule -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: few rules for Indonesian
Selamat datang! I believe the developers would be happy to include it. I know Malay, and would like to suggest concision in a few places: * You can leave out KESALAHAN (mistake/error) from rule id= * adalah can also be left out from the message in such a sentence construction (a lesson I learned from a grammar book) kb arnaud abbal wrote thus at 05:03 PM 29-10-13: Hi all, I discovered LT few weeks ago, looks very interesting. I'm currently working on adding words to Ammar Shadiq's Libre Office/Open Office spellchecking extension http://ammarshadiq.web.id/blog/2009/11/21/dictionary-dan-hyphenation-bahasa-indonesia-di-open-office-extension/http://ammarshadiq.web.id/blog/2009/11/21/dictionary-dan-hyphenation-bahasa-indonesia-di-open-office-extension/ which use hunspell's data. Here are some rules, hoping I'll write more in a near future, if you're willing to set up Indonesian as a supported language. Thanks for the great work you've done already. arno rule id=KESALAHAN_DIKTATUR_KEDIKTATORAN name=kesalahan diktatur/kediktatoran pattern tokendiktatur/token /pattern messageApakah maksud anda adalah suggestionkediktatoran/suggestion?/message example type=incorrectSebuah markerdiktatur/marker yang jahat/example example type=correctSebuah kediktatoran yang jahat/example /rule rule id=KESALAHAN_MEMFORMULASIKAN_MERUMUSKAN name=kesalahan memformulasikan/merumuskan pattern tokenmemformulasikan/token /pattern messageApakah maksud anda adalah suggestionmerumuskan/suggestion?/message example type=incorrectDia markermemformulasikan/marker sebuah teori yang menyeluruh/example example type=correctDia merumuskan sebuah teori yang menyeluruh/example /rule rule id=KESALAHAN_TERJELEK_PALING_JELEK name=kesalahan terjelek/paling jelek pattern tokenterjelek/token /pattern messageApakah maksud anda adalah suggestionpaling jelek/suggestion?/message example type=incorrectItu film yang markerterjelek/marker yang saya pernah menonton/example example type=correctItu film yang paling jelek yang saya pernah menonton/example /rule rule id=KESALAHAN_TERINTEGRASI_TERPADU name=kesalahan terintegrasi/terpadu pattern tokenterintegrasi/token /pattern messageApakah maksud anda adalah suggestionterpadu/suggestion?/message example type=incorrectSebuah sistem yang markerterintegrasi/marker/example example type=correctSebuah sistem yang terpadu/example /rule rule id=KESALAHAN_SEMUA_SELURUH name=kesalahan semua/seluruh pattern tokensemua/token /pattern messageApakah maksud anda adalah suggestionseluruh/suggestion?/message example type=incorrectMengendalikan markersemua/marker sistem/example example type=correctMengendalikan seluruh sistem/example /rule rule id=KESALAHAN_GDP_PDB name=kesalahan GDP/PDB pattern tokenGDP/token /pattern messageApakah maksud anda adalah suggestionPDB/suggestion?/message example type=incorrectPertumbuhan markerGDP/marker/example example type=correctPertumbuhan PDB/example /rule rule id=KESALAHAN_POLITISI_POLITIKUS name=kesalahan politisi/politikus pattern tokenpolitisi/token /pattern messageApakah maksud anda adalah suggestionpolitikus/suggestion?/message example type=incorrectPara markerpolitisi/marker/example example type=correctPara politikus/example /rule rule id=KESALAHAN_MEMODERNISASIKAN_MEMODERNISASI name=kesalahan memodernisasikan/memodernisasi pattern tokenmemodernisasikan/token /pattern messageApakah maksud anda adalah suggestionmemodernisasi/suggestion?/message example type=incorrectreformasi-reformasi yang bertujuan markermemodernisasikan/marker negara/example example type=correctreformasi-reformasi yang bertujuan memodernisasi negara/example /rule -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
pattern of tokens web search
Dear all, Is there any way in the Internet to search the web for a pattern, like VB important? kb -- Android is increasing in popularity, but the open development platform that developers love is also attractive to malware creators. Download this white paper to learn more about secure code signing practices that can help keep Android apps secure. http://pubads.g.doubleclick.net/gampad/clk?id=65839951iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: planning LT homepage relaunch
From a highly regarded website consultant: * Top 10 Guidelines for Homepage Usability http://www.nngroup.com/articles/top-ten-guidelines-for-homepage-usability/ * The Ten Most Violated Homepage Design Guidelines http://www.nngroup.com/articles/most-violated-homepage-guidelines/ * 113 Design Guidelines for Homepage Usability http://www.nngroup.com/articles/113-design-guidelines-homepage-usability/ * Top 10 Mistakes in Web Design http://www.nngroup.com/articles/top-10-mistakes-web-design/ kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
en bloc
They walked out of the hall en bloc. is flagged by rule id=THREE_NN name=Readability: Three nouns in a row Not sure what's the best way to fix this. Using exception would solve this one but problems would remain for other rules if LT still regards en bloc as two nouns. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
THREE_NN rule (Re: en bloc)
Dominique Pellé wrote thus at 02:54 PM 27-10-13: Some of them are not even 3 nouns in a row but contain verbs (clear false positives) such as in Government revenue comes mainly from [...]. But even when it flags 3 nouns, it's still annoying because it's not incorrect. I agree that it's better disabled by default. Besides that, how about adding exceptions for verbs? The rule is not meant to flag grammatical errors, but a possible readability issue. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135991iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: en bloc
Sounds like a good idea. I don't know how to do that though. R.Baars wrote thus at 02:28 PM 27-10-13: That is french, Why not remove noun tags for this word group in the disambiguator? op 27-10-13 07:08, Kumara Bhikkhu schreef: They walked out of the hall en bloc. is flagged by rule id=THREE_NN name=Readability: Three nouns in a row Not sure what's the best way to fix this. Using exception would solve this one but problems would remain for other rules if LT still regards en bloc as two nouns. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135991iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Always late, is he?
I suggest either removing Always and Never from this rule, or add adjectives to the exception in the last token. rule pattern token postag=SENT_START/ !-- or but -- marker token regexp=yesAlways|Hardly|Never|Rarely|Seldom/token /marker tokenexception postag=VB.*|MD postag_regexp=yes//token /pattern messageThe adverb '\2' is usually not used at the beginning of a sentence./message urlhttp://faculty.washington.edu/marynell/grammar/AdverbPl.html/url example type=incorrectmarkerAlways/marker I am happy./example example type=correctI am markeralways/marker happy./example example type=correctmarkerAlways/marker wear your helmet./example /rule This is an example why testing with Wikipedia is inadequate. We need to test with non-formal styles of language. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: One of '{example}' is expected.
Daniel Naber wrote thus at 07:01 PM 16-10-13: On 2013-10-16 12:55, Kumara Bhikkhu wrote: size[size/NN:UN,B-NP-singular|E-NP-singular]. token postag=NN/ This needs to be postag=NN:UN or maybe postag=NN.* postag_regexp=yes Thanks. NN.* means all these, right? NNNoun, singular or mass: bicycle, earthquake, zipper NNS Noun, plural: bicycles, earthquake, zippers NN:UMass noun #new tag - deviation from Penn: admiration, air, Afrikaans NN:UNNoun used as mass #new tag - deviation from Penn: establishment, wax, afternoon NNP Proper noun, singular: Denver, DORAN, Alexandra NNPS Proper noun, plural: Buddhists, Englishmen kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: One of '{example}' is expected.
Daniel Naber wrote thus at 03:53 PM 16-10-13: On 2013-10-16 08:26, Kumara Bhikkhu wrote: Error: XML validation failed: org.xml.sax.SAXParseException; lineNumber: 9; columnNumber: 16; cvc-complex-type.2.4.a: Invalid content was found starting with element 'suggestion'. One of '{example}' is expected. This means that the short element must come directly after the suggestion element, otherwise the parser won't accept it. Then, shouldn't it say One of short is expected.? Better still, say something like what you said, something mere mortals like me can understand. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Taking Style Tips from Natural Language
This might help you write clearer code: http://jamesgolick.com/2008/1/13/taking-style-tips-from-natural-language.html -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
dictionary files
Not sure if this is of any use to any of you http://www.quinion.com/mqa/spell.htm British English dictionary (333K) US English dictionary (328K) French dictionary (552K) French-Canadian dictionary (552K) German dictionary (530K) Italian dictionary (239K) Dutch dictionary (787K) Norwegian dictionary (275K) Danish dictionary (149K) Afrikaans dictionary (745K) -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: English chunking
Is everyone I talked to regarded as a noun phrase? Daniel Naber wrote thus at 06:11 PM 07-10-13: Hi, now that LT 2.3 is released, I'd like to point once more to our new chunking feature for English. It's supposed to detect chunks (also called phrases) so that rules can make use of them. For example, this text used to lead to a false alarm: Over 500 college and university chapters. That's because LT only considered 500 college and assumed that college needs to be in plural. With LT 2.3, it now knows that college and university chapters is one plural noun phrase and thus avoids the false alarm. This new feature is only of use if we actually use these chunks in rules, by adding new rules and rewriting some existing rules. This feature is documented at http://wiki.languagetool.org/using-chunks Regards Daniel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Show tags
S I[I/PRP,B-NP-singular|E-NP-singular] love[love/VBP,B-VP] everyone[everyone/NN:U, everyone/PRP,B-NP-singular|E-NP-singular] I[I/PRP,B-NP-singular|E-NP-singular] talked[talk/VBD, talk/VBN,B-VP] to[to/IN, to/TO,B-NP-singular|E-NP-singular].[./., /S,O] Is it just me, or everyone else also finds this difficult to read. I'm thinking how can we make this easier to read. One tag analysis per line? kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
LT flags 'students', suggests 'pupils'
Malaysia was colonised by the British, so British English is sort of the standard. But now American English has gain acceptance here too. Even as early as 20 years ago, when I was in university, our English lecturers allowed both variety of spellings. Some American spellings have even become more widely used, e.g., 'students'. So, I was a little bit surprised to find it flagged. I learnt that this is not found in grammar.xml. So, I can't just untick it in Options. The file that provides for this is replace.txt under en-GB. For my own purpose, I can remove the file to solve the problem. Less tech-savvy ones may find it annoying. Request: Provide option to not flag this category in Options. Move the 8 American British phrases rules now in grammar.xml into replace.txt. Also, I noticed some spelling errors in replace.txt: * trash=rubbis (rubbish) * trashcan=dustbin (trash can) kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: [LT] Important: Need to specify installing 32-bit JRE7
Daniel Naber wrote thus at 09:02 PM 06-10-13: On 2013-10-05 12:36, Kumara Bhikkhu wrote: Without JRE7 32-bit, this is what LibO (on Windows Home Premium 64-bit) shows: I have improved our website a bit to mention that. I suggest you open a bug report at OO/LO asking for a better wording of that dialog. The text should probably mention the 32/64-bit issue. Someone has done that long time ago, but nothing was done. So, I've added some comments on how bad it appeared for me. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: zero or more tokens
Done that. You have She wants you to goes there. I've been trying to called you. She is manipulating her father to gets her way. He tried very hard to lifted the rock. The rule I created flags the first and the third. It wrongly flags only the last sentence here: The calendar her eyes kept straying to said it was December sixth. But everyone I talked to said it was too risky. Whatever college he wants to go to is fine with me. So, consider integrating the 2. kb =?ISO-8859-2?Q?Marcin_Mi=B3kowski?= wrote thus at 05:58 PM 07-10-13: Read the issue description on github. I added possible false alarms there. Regards Marcin -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
zero or more tokens
How do I indicate zero or more token? This doesn't work: token min=0/ kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Modules for individual supported languages?
Dominique Pellé wrote thus at 02:21 PM 05-10-13: Content-Transfer-Encoding: base64Jan Schreiber wrote: The LT-2.3 download standalone file is currently 67Mb (45 Mb for LibreOffice) with all languages. Is it such a big deal? I prefer to have all languages and keep it simple. Benefits of simplicity tend to be underestimated. Please allow me to offer my opinion. As I understand the current situation, I agree that LT remains to be for all languages. At the same time I also understand the demand for single language versions. Wanting a lighter program is one. The other is the perception that If it's meant for dozens of languages, it probably doesn't do one well. I'm speaking for myself as that's the idea I had when I was looking for a grammar checker extension. I went for After the Deadline first for this reason. Installing plugins over a plugin may be fine for us, but may confuse some users. It also demands more work, which I think is not worth doing, at least for the time being. I suggest replying with appreciation for the suggestion and that the developers cannot afford to do that at the moment. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
[LT] Important: Need to specify installing 32-bit JRE7
For LibO to work with LT3.2, the user needs to install the 32-bit JRE7. User may install the 62-bit version and meet an error message, because LibO is 32-bit. Please specify at download area of homepage and at http://languagetool.org/issues/. kb -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: zero or more tokens
Daniel Naber wrote thus at 04:31 PM 05-10-13: On 2013-10-05 09:21, Kumara Bhikkhu wrote: How do I indicate zero or more token? This doesn't work: token min=0/ There's still a bug in our code, only zero or one currently works. Your example should match zero or one token (max=1 is the default). If it doesn't, please post a complete example. OK. Promise you won't laugh. :-) I don't have a good head for details. That's why I need a good grammar checker. I'm trying to improve upon the to infinitive rule with this: rule id=TO_NON_BASE name='to' + non-base form pattern token min=1 postag_regexp=yes postag=VBD|VBG|VBN|VBP|VBZ/ token min=0/ tokento/token marker token postag=VBD|VBG|VBN|VBZ/token /marker /pattern messageYou might need to use the base form of the verb here: suggestionmatch no=4 postag=VB/match/suggestion./message shortTo + infinitive/short example type=correctI was surprised to markerlearn/marker this./example example type=correctHe spoke to chosen people./example example correction=learn type=incorrectI was surprised to markerlearns/marker this./example /rule -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
I think I've a better idea now. Thanks. Ruud Baars wrote thus at 02:23 PM 04-10-13: Most words have multiple possible postags. To dtermine which one is the actual one in this sentence, the disabiguator looks at the words around it, and tries to make a decision. The code for disambiguation is in disambiguation.xml in the resource directory. It is very much like the LT rules. Ruud -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: [LT] Important: Need to specify installing 32-bit JRE7
As I've just learnt, currently LibreOffice 64-bit is available for Linux, but not for Mac and Windows. So, it's good to highlight the need for 32-bit JRE7 for Mac and Windows users. Xavi Ivars wrote thus at 06:47 PM 05-10-13: Java has to be on the same architecture than LibreOffice. So if you have 32bits LibreOffice, Java needs to be also 32 bits. But if you have 64bits LibreOffice, you need 64bits Java. That's why it's not possible to use LT 2.3 in Mac right now: there are only 32bits builds of LibO for OS X while there are only 64bits builds of Java. -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: Facebook stats
Might be just a coincidence. Anyway, recently, I've been planting baits on QA and bugs forums of OOo and LibO, plus other places. Hope to attract coders that way too. Jan Schreiber wrote thus at 04:40 PM 04-10-13: Hello everybody, FYI, here are a few screenshots that show recent statistics about our Facebook page. User activity there is increasing slowly but steadily. In the last few days, two users have contacted us via our Facebook wall. --Jan -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel
Re: He tried not to laughs.
By verb I mean of course the wrong verb (i.e., non-base form). Anyway, mails you from always end up as below: all joined up. Marcin MiÅkowski wrote thus at 04:04 PM 04-10-13: W dniu 2013-10-04 05:00, Kumara Bhikkhu pisze: Disambiguator. I don't even know what that is. Never mind. Thanks. I'll add rules for not to (verb) and (verb) to (verb), and see how that goes. What do you want to achieve this way? These will not be error patterns... Best, Marcin kb Marcin Mià kowski wrote thus at 06:39 PM 03-10-13: W dniu 2013-10-03 11:43, Kumara Bhikkhu pisze: Marcin Mià kowski wrote thus at 04 04:15 PM 03-10-13: Hi, W dniu 2013-10-03 06:24, Kumara Bhikkhu pisze: Can the one who created this contact me personally? It's not triggering He tried not to laugh_s_. I don't know how to correct it. I'll write this on the list -- laughs is also plural of laugh, which is excluded by the exception below (NNS). Unfortunately, without this exception, a lot of false alarms are found.I thought so.Now, maybe we could have a second variant of the rule that takes not +to, and then the exception would not be required. This would have to be tested on a large corpus. Anyway of indicating verbs but excepting those what are also nouns? No, it's not possible unless you have a perfect rule in the disambiguator for this. Best, Marcin -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel . -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk ___ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel