Re: LanguageTool for Chrome

2015-10-27 Thread Daniel Naber
On 2015-10-23 18:52, Jaume Ortolà i Font wrote: > The two first issues are solved. The last one not: composing a message > in gmail. In plain text mode, the line breaks disappear. In html mode, > all formatting disappears. There are still cases where replacement will not work, but they should

Re: LanguageTool for Chrome

2015-10-27 Thread Daniel Naber
On 2015-10-27 18:29, Xavi Ivars wrote: > I just did som esmall tests in Gmail, and it still broke the layout, > removing all the line breaks. I tested it in gmail and it worked for me. Are you sure you're using the updated version (0.8.2)? Can you send a screen shot of the input text before

Re: LanguageTool speed measurements, from LT-1.8 till LT-3.2-SNAPSHOT

2015-10-25 Thread Daniel Naber
On 2015-10-25 05:13, Dominique Pellé wrote: Hi Dominique, > I measured LT speed using command line version of LanguageTool. > Recorded numbers are user time reported by Linux time command. thanks, that's interesting. I think one issue would need to be fixed before this can be automated: Maybe

Re: LanguageTool for Chrome

2015-10-25 Thread Daniel Naber
On 2015-10-25 14:44, Juan Martorell wrote: > I guess so. However, since I was not directly involved in Hunspell > integration into LT I want to make sure that the upgrade introduces no > regression. Besides, the extension of the dictionary in LO and in LT > do not match -- I think that needs a

Re: LanguageTool for Chrome

2015-10-24 Thread Daniel Naber
On 2015-10-23 18:52, Jaume Ortolà i Font wrote: > The two first issues are solved. The last one not: composing a message > in gmail. In plain text mode, the line breaks disappear. In html mode, > all formatting disappears. I understand the problem now, but it will take some time to fix. Regards

Re: LanguageTool for Chrome

2015-10-24 Thread Daniel Naber
On 2015-10-24 16:52, Juan Martorell wrote: > Latest dictionary in LO5 comes with version 0.8. However in LT source > code repository I see the version 0.2. So can't we just update the version in LT to improve support a lot, with little work for us? > Extending the Spanish dictionary is an

Re: LanguageTool for Chrome

2015-10-23 Thread Daniel Naber
On 2015-10-22 11:33, Juan Martorell wrote: > Plus, the extension is not configurable, so you're forced to the > default behaviour, which for Spanish is not optimal given the amount > of false spell check errors because of the lack of dictionary words -- > and you cannot add them to your local

Re: LanguageTool for Chrome

2015-10-23 Thread Daniel Naber
On 2015-10-22 11:21, Jaume Ortolà i Font wrote: Hi Jaume, > I have tested it and I found some strange behavior when trying to > replace errors with suggestions. thanks for your feedback, a new version (0.8.1) has just been released, could you test if it fixes the issues? Everybody else,

Re: LanguageTool for Chrome

2015-10-22 Thread Daniel Naber
On 2015-10-19 23:57, Daniel Naber wrote: > there's now an early version of a LT extension for Chrome available: > > https://chrome.google.com/webstore/detail/languagetool/oldceeleldhonbafppcapldpdifcinji?hl=de=DE As there was no feedback, I'm not sure if nobody has tested this or if

Re: using ngram data to detect errors

2015-10-19 Thread Daniel Naber
Hi, this is just a reminder that the data for statistical error detection exists, now people just need to use it... Regards Daniel On 2015-09-16 22:50, Daniel Naber wrote: > Hi, > > some time ago, I've added a rule for English to detect errors > statistically, by using large ngr

LanguageTool for Chrome

2015-10-19 Thread Daniel Naber
Hi, there's now an early version of a LT extension for Chrome available: https://chrome.google.com/webstore/detail/languagetool/oldceeleldhonbafppcapldpdifcinji?hl=de=DE Please give it a try and report bugs and ideas. The extension cannot be found yet by searching but only via the link above.

Re: Behavior of non-breaking space U+00A0 in LanguageTool

2015-10-16 Thread Daniel Naber
On 2015-10-12 15:31, Dominique Pellé wrote: > In French at least, handling "U+202F NARROW NO-BREAK SPACE" > correctly (i.e. as a space for LT) would be useful. This should be implemented now. Regards Daniel --

Re: nightly regressions

2015-10-14 Thread Daniel Naber
On 2015-10-14 06:37, Dominique Pellé wrote: > I can see that sometimes, it's OK to treat nbsp as a whitespace, > but here, it's not since the goal of the rule is precisely to check > for > nbsp (either U+00A0 or U+202F). I'm not sure how to fix it. Maybe > we need an API to return the kind of

Re: new syntax available

2015-10-14 Thread Daniel Naber
On 2015-10-14 14:01, Dominique Pellé wrote: > Thanks for the (?:\s+) change! > How about... (?:[\sxA0]+) instead? Done (written as \u00A0 in the regex). > Being able to highlight part the regexp would be useful with > ...(...) Most of the places > where I'm thinking of using would need

Re: new syntax available

2015-10-14 Thread Daniel Naber
On 2015-10-11 12:31, Daniel Naber wrote: >> Use of "exact-meaning" would be very rare. >> Maybe a better name: > > I think that's okay with me, but I need to think more about it. Maybe > the easiest implementation would be to just replace " &quo

nightly regressions

2015-10-13 Thread Daniel Naber
Hi, we have quite some changes in the nightly tests today. I'm not sure what the cause is, could you check your language and see if the changes are good or bad? https://languagetool.org/regression-tests/20151013/ Regards Daniel

Re: Behavior of non-breaking space U+00A0 in LanguageTool

2015-10-12 Thread Daniel Naber
On 2015-10-11 19:18, Dominique Pellé wrote: > I think that spaces or non-breaking spaces should behave > the same for LanguageTool. I guess so. I've made a commit that changes this. It broke some tests: I fixed fr/grammar.xml and commented out tests in QuestionWhitespaceRuleTest. Could you

Re: new syntax available

2015-10-11 Thread Daniel Naber
On 2015-10-10 06:16, Dominique Pellé wrote: > I'm not sure I understand how it would work for users. My idea was that it would work automatically. But you're right that users might also paste text with lines breaks, and my idea of having a parsing or normalization (when reading the input)

Re: Several in same rule?

2015-10-11 Thread Daniel Naber
On 2015-10-11 11:58, Dominique Pellé wrote: > Would be possible to allow for several tags > in the same rule? I don't think it's very difficult. I could put it on my TODO list, but I cannot make any promises about when I have time for this. Regards Daniel

Re: new syntax available

2015-10-09 Thread Daniel Naber
On 2015-10-09 07:32, Dominique Pellé wrote: > I suppose that I care more than most because I only use LT to check > text files where the situation is frequent. I think normalizing the text makes sense if: 1) single line breaks get removed from plain text files (but not double spaces) 2) this

Re: new syntax available

2015-10-08 Thread Daniel Naber
On 2015-10-08 06:59, Dominique Pellé wrote: > ... then the regexp rule does not detect all the errors > that the rule detected. It does not detect errors > in "foo bar" (2 spaces or more, or tabs) or when there is a > new line as in: > > foo > bar > > How to fix it? I don't think it

Re: new syntax available

2015-10-07 Thread Daniel Naber
On 2015-10-07 06:41, Dominique Pellé wrote: Hi Dominique, thanks for your feedback. > 1) How do I highlight only a subset of the match? Trying the above > rule, I see this: That's not yet possible, but I like the idea of a 'marker' attribute. I'll add that to my TODO list. > 2) Is there

Re: Invalid XML in Dutch grammar.xml

2015-10-06 Thread Daniel Naber
On 2015-10-06 03:08, Dominique Pellé wrote: > Het ligt door de > war. > > I also wonder why LT accepts the XML file without giving errors. Technically, that XML is valid. While < needs to be escaped, > doesn't. So the value of the correction attribute is: "in> de war" Regards Daniel

Re: new syntax available

2015-10-06 Thread Daniel Naber
On 2015-10-06 22:02, Jaume Ortolà i Font wrote: > Thanks, Daniel. It is very useful. > > Do you suggest converting all simple rules to this new syntax? Do you > expect some improvement in performance? I leave that up to the maintainer of each language. The reason for this feature is that some

new syntax available

2015-10-05 Thread Daniel Naber
Hi, there's now a first and limited implementation of the syntax in master. Instead of foo you can now use foo But be aware that this is a real regular expression that ignores tokens, so it matches anything with the substring 'foo'. Also, the regular expression is case-insensitive by

ignore tags from binary dictionary

2015-09-29 Thread Daniel Naber
Hi, English and German now have a file "removed.txt" to ignore entries from our binary POS tagger dictionaries. This way incorrect items can be ignored without rebuilding the binary dictionary. If you need this for your language, add this code to its tagger (with the language code instead of

Re: Idea to introduce tag in LT grammar rules.

2015-09-29 Thread Daniel Naber
On 2015-09-28 20:36, Dominique Pellé wrote: >>> a (plein temps|chaque fois|rude épreuve|vol >>> d’oiseau) >>> >>> It would make some such rules a lot simpler to write and more >>> concise. >> >> Dominique, are you planning to implement this? > I have very little time at home, so it's unlikely

ANN: LanguageTool 3.1

2015-09-28 Thread Daniel Naber
I'm pleased to announce the release of LanguageTool 3.1. As usual, it is available for download at https://www.languagetool.org. Changes include: * Many updates for the error detection rules for English, French, German, Japanese, Polish, Portuguese, Russian, and Ukrainian * Bug fixes * Java

towards grammar checking in Firefox

2015-09-27 Thread Daniel Naber
Hi, there's now a patch available that allows the development of grammar checkers in Firefox as add-ons. In other words, a Javascript-based add-on can underline errors in textareas. See the attached screenshot, where the add-on sends the text to the languagetool.org server and then

Re: [jna.Native] - Could not initialize class com.sun.jna.Native

2015-09-24 Thread Daniel Naber
On 2015-09-24 14:50, Remy Leone wrote: > $ java -cp languagetool-server.jar org.languagetool.server.HTTPServer > --port 8081 > WARNING: running in HTTP mode, consider using SSL by running > org.languagetool.server.HTTPSServer instead > Setting up thread pool with 10 threads > Starting

Re: languagetool.org on mobile

2015-09-21 Thread Daniel Naber
On 2015-09-11 22:11, Dominique Pellé wrote: > It's not just on mobile that the text area disappears. > Even on desktop it disappears if the browser window > is resized small enough. This has been changed now, both mobile and narrow desktop browsers should always display the textarea. Everybody

Re: reminder: feature freeze for 3.1

2015-09-21 Thread Daniel Naber
On 2015-09-20 20:35, Andre Couture wrote: > Just wanted to check if you were able to merge the xml rules changes > that Doug did? > The short messages mainly. I agree that our elements are not very consistent, but before I merge your changes here are a few remarks: * Are these distinctions

reminder: feature freeze for 3.1

2015-09-20 Thread Daniel Naber
Hi, this is a reminder that we're now in feature freeze for LT 3.1. Please add missing translations at Transifex and don't introduce new features or English strings that need translation. The release is planned for 2015-09-28. http://wiki.languagetool.org/roadmap Regards Daniel

MS Word macro

2015-09-18 Thread Daniel Naber
Hi, someone has written a simple macro to use languagetool.org from MS Word. The code and his message are mostly German, but maybe it's useful to somebody: http://www.office-loesung.de/p/viewtopic.php?f=162=702408 Regards Daniel

using ngram data to detect errors

2015-09-16 Thread Daniel Naber
Hi, some time ago, I've added a rule for English to detect errors statistically, by using large ngram data sets. I've activated the rule now for all languages that we have data for: Chinese, French, Italian, Russian, and Spanish (German had been activated for some time already). That means

reminder: upcoming feature freeze for 3.1

2015-09-15 Thread Daniel Naber
Hi, just a friendly reminder: the release of LT 3.1 is planned for 2015-09-28, feature freeze starts 2015-09-20. Regards Daniel -- ___ Languagetool-devel mailing list

Re: Examples needed (several rules)

2015-09-11 Thread Daniel Naber
On 2015-09-11 11:13, bloc...@web.de wrote: Hi Thomas, > I'd like to extend the tool by several rules I found here: > http://www.andreaseschbach.de/schreiben/10punkte/10punkte.html > > Can someone provide an example for the following case: > 2. if the word ist in a list of words, strikeout the

languagetool.org on mobile

2015-09-11 Thread Daniel Naber
Hi, on mobile, languagetool.org doesn't show the textarea where you can try LT. Originally, this was on purpose, but nowadays smartphone displays have good resolution and I think we should show it at least for modern devices. The problem is I cannot really make the media query ("@media

Re: marketing LT in US/UK

2015-09-10 Thread Daniel Naber
On 2015-09-10 11:00, Richard Eckart de Castilho wrote: > does the map show visitors or users (e.g. people interacting sensibly > with the forms on the website)? > > A substantial part of requests from some of the countries, e.g. > Russia, might be due to bots. The map comes from piwik which

marketing LT in US/UK

2015-09-09 Thread Daniel Naber
Hi, the attached image shows where languagetool.org visitors are coming from: the darker the blue, the more visitors. As you can see, relatively few users are from US/UK, even though English is one of the languages that we support for a long time and whose support should be quite good.

Re: Idea to introduce tag in LT grammar rules.

2015-09-07 Thread Daniel Naber
On 2015-09-07 10:55, Dominique Pellé wrote: > If cannot be combined with then we lose the ability to > have > inflected="yes", postag="...", etc. > > But I wonder how expensive this is. I assume that tokenization makes > matching > faster than applying regexp to whole sentences. If it slows

Re: Idea to introduce tag in LT grammar rules.

2015-09-06 Thread Daniel Naber
On 2015-09-05 22:53, Dominique Pellé wrote: > It is similar to what Daniel wrote earlier as well: > > a (plein temps|chaque fois|rude épreuve|vol > d’oiseau) So instead of ... we would have ..., but it couldn't be combined with , is that right? We'll need to decide if a plain temps implies

Re: Idea to introduce tag in LT grammar rules.

2015-09-05 Thread Daniel Naber
On 2015-09-04 23:21, Dominique Pellé wrote: > I wish I could write a rule pattern like this: > > plein temps#chaque fois#rude épreuve#vol > d’oiseau What about a more radical approach (which would be trickier to implement): a plein temps|chaque fois|rude épreuve|vol d’oiseau Then you

Re: build error with mvn from freshly cloned git repository

2015-08-29 Thread Daniel Naber
On 2015-08-29 00:26, Dominique Pellé wrote: (default-compile) on project languagetool-dev: Fatal error compiling: invalid target release: 1.8 - [Help 1] You need Java 8 to compile LT now, as the dev module has been moved to Java 8 already. Regards Daniel

Re: Compound words for en_GB + pt_PT

2015-08-29 Thread Daniel Naber
On 2015-08-28 10:50, Marco A.G.Pinto wrote: I have used Proofing Tool GUI to extract all the compounds from the British and Portuguese dictionaries: - en_GB - 5390 compounds https://dl.dropboxusercontent.com/u/30674540/en_gb_compound_words_5390words_20150828.txt I think a native speaker

Re: Subjects of post to this mailinglist

2015-08-29 Thread Daniel Naber
On 2015-08-29 12:02, Purodha Blissenbach wrote: is it possible to have a string like [language tool] or maybe something shorter prefixed to the subjects of mailing list posts? Many lists have that, and at least to me it is really helpful in quickly understanding what is what in my 100+ daily

changes in Firefox add-ons

2015-08-23 Thread Daniel Naber
Hi, Mozilla has announced changes in the way Firefox add-ons will work in the future[1]. In the long term ( 1 year), this will also affect LanguageToolFx, our add-on for Firefox. We won't be able anymore to access the user's personal dictionary, so spell checking won't work very well as our

ngram rule for German

2015-08-20 Thread Daniel Naber
Hi, on languagetool.org, a word confusion rule for German has just been activated. It works just like the confusion rule for English: it uses a large set of ngram occurrence data to decide which variant is (probably) correct. Here are two examples of errors now detected thanks to this rule:

Re: Debugging NullPointerException in tests

2015-08-19 Thread Daniel Naber
On 2015-08-08 15:22, Ionuţ Păduraru wrote: It only happens when I use JAVA8 (both OpenJDK and Oracle). It does NOT reproduce with Java7. Java 1.8.0_60 has been released yesterday. So far I could not reproduce the issue with it. Can you confirm that _60 might fix it? Regards Daniel

updated ngram data

2015-08-17 Thread Daniel Naber
Hi, I've updated our ngram data that is used to power our English homophone confusion rule. This is the rule that finds many cases where words like there/their, breathe/breath etc are confused. The new ngram data is based on the Google ngram data from 2012 and it's much larger than the

Re: updated ngram data

2015-08-17 Thread Daniel Naber
On 2015-08-17 17:57, Andre Couture wrote: (default task-2) Format version is not supported (resource MMapIndexInput(path=/google-ngram-index/3grams/_1e4_Lucene50_0.tim)): 1 (needs to be between 0 and 0) (default task-2) RuleMatch error unhandled, type =Typographical Do I simply need to

Re: Terminology checking

2015-08-14 Thread Daniel Naber
On 2015-08-13 18:56, Ashish Agrawal wrote: I would like to integrate terminology checking (through a terminology management system) into LanguageTools. E.g. Checking for brand names, specific terms in medical terminology, etc.. If your terminology management system can give you the terms,

Re: IntelliJ IDEA Ultimate Edition for LanguageTool developers

2015-08-13 Thread Daniel Naber
On 2015-08-13 17:01, Juan Martorell wrote: They also kindly ask us mentioning their support: I've added the IntelliJ logo to https://languagetool.org/development/ Regards Daniel --

Re: web spell checking

2015-08-12 Thread Daniel Naber
On 2015-08-13 02:00, Andrew Pullins wrote: Thank you. you know I saw that on the home page and for what ever reason did not connect that it was doing what I want it to do.. can it be set to auto check? No, at least not without extending the Javascript code. Regards Daniel

Re: web spell checking

2015-08-10 Thread Daniel Naber
On 2015-08-10 03:45, Andrew Pullins wrote: Hi Andrew, 366 of the Linux Action Show here [1]. I was wondering if I would be able to take Languagetool and add it to a website that I am developing. thanks for your interest in LT. You can integrate LT like we do on languagetool.org:

Re: Debugging NullPointerException in tests

2015-08-09 Thread Daniel Naber
On 2015-08-08 15:22, Ionuţ Păduraru wrote: Or course, none of these workarounds makes any sense (but they do fix/workaround the issue)... To me it looks like a JVM bug. I'm using Oracle Java 1.8.0_51, which version are you using? How often do you need to run the test to reproduce the

Debugging NullPointerException in tests

2015-08-07 Thread Daniel Naber
Sometimes the Jenkins tests fail in PatternRuleMatcher line 300, usually when running the Catalan pattern tests: if (j = positions.size()) { j is an int, positions is ListInteger, both are local variables. How can this cause a NullPointerException? I see no place in the code where

Re: probability theory code review?

2015-08-06 Thread Daniel Naber
On 2015-08-06 11:31, Dawid Weiss wrote: There are other issues where I could need help from an expert. For example, results don't get better when we use 4grams instead of 3grams. This is, I think, a general conclusion from using shingles of any data -- if you're increasing their lengths

probability theory code review?

2015-08-06 Thread Daniel Naber
Hi, we're using a bit probability theory to calculate ngram probabilities. This way we can decide which word of a homophone pair like there/their is (probably) correct. Is anybody here familiar with probability theory and could review that code? The main part is here:

Re: LanguageTool demo

2015-08-06 Thread Daniel Naber
On 2015-08-06 09:55, Andriy Rysin wrote: I am going to do a presentation on LanguageTool and its Ukrainian module here in Kyiv next week and wondering if you can suggest some materials/slides for LT architecture so I don't create them from scratch. I have only 1 hour so high level overview is

Re: Show the POS of a token in the message

2015-07-31 Thread Daniel Naber
On 2015-07-31 14:41, Mike Unwalla wrote: I want to show the POS of a token in a message, but I cannot see how to do it. Is it possible? How? I don't think so. I think that's more a question of debugging and we have separate tools for that, e.g. using the -v option for the command line

anti patterns for Java rules

2015-07-30 Thread Daniel Naber
Hi, the Rule class now has a method getAntiPatterns() that sub classes can overwrite to ignore certain phrases, in order to avoid false alarms. To make use of that, your rule needs to call getSentenceWithImmunization() in its match() method and also test the token readings for isImmunized()

Re: Integration of LanguageTool with VisualEditor

2015-07-29 Thread Daniel Naber
On 2015-07-29 09:49, Ankita Kumari wrote: Meanwhile, I know this question is very vague, but is there something that could disable only spelling errors. I did a recheck just now and *only* spelling errors are being overlooked. If you use language code en instead of en-US, en-GB etc. you will

Re: Integration of LanguageTool with VisualEditor

2015-07-28 Thread Daniel Naber
On 2015-07-28 12:51, Ankita Kumari wrote: I have the server hosted here : http://tools.wmflabs.org/languageproofing/ [1] I get Internal sever error, this is something that cannot be debugged from outside. I suggest you ask the Tool Labs people about which log file the error might appear in.

Re: Integration of LanguageTool with VisualEditor

2015-07-28 Thread Daniel Naber
On 2015-07-28 19:59, Ankita Kumari wrote: I am sorry for not making this clear earlier. You need to visit this page : http://tools.wmflabs.org/languageproofing-ui/ [1] That page works for me, i.e. the error is detected and marked with a yellow background. But I don't see anything specific

Re: Re[2]: Fwd: Re[2]: Fwd: Re: pt_PT - spelling.txt

2015-07-28 Thread Daniel Naber
On 2015-07-28 21:44, Yakov Reztsov wrote: Step to reproduce: 1. Computer: Linux 32-bit, Java 7 2. LanguageTool-20150726-snapshot.zip 3. java -jar languagetool.jar 4. Language is already set to English or Russian. 5. User select language to Portuguese. 6. A window appears with an error. I

Re: Integration of LanguageTool with VisualEditor

2015-07-28 Thread Daniel Naber
On 2015-07-28 20:56, Ankita Kumari wrote: Yes, it is. But that's exactly the point. It works fine for this. But does not work for VisualEditor. It only detects repetition error. Where can we test your VisualEditor integration? Have you tried debugging the HTTP requests using Firebug? Regards

Re: Fwd: Re[2]: Fwd: Re: pt_PT - spelling.txt

2015-07-27 Thread Daniel Naber
On 2015-07-27 21:27, Marco A.G.Pinto wrote: There are some issues and, since I have no idea of how to fix it, I decided to forward it to the mailing list. I've never seen that error before, we'll need instructions on how to reproduce it in order to fix it. Regards Daniel

Re: pt_PT - spelling.txt

2015-07-22 Thread Daniel Naber
On 2015-07-16 14:05, Marco A.G.Pinto wrote: Is there something that can be fixed in the developers side or should I leave out words with spaces? You could try using AbstractSimpleReplaceRule2 instead of AbstractSimpleReplaceRule, according to its javadoc it supports words with spaces.

help with testing a new Firefox add-on

2015-07-15 Thread Daniel Naber
Hi, I've co-authored a Firefox add-on that automatically selects the spell checker dictionary depending on the language you type. The add-on has just been released and now we need feedback. Please give it a try and send me reports of problems, ideas, or success:

Re: pt_PT - Added: replace.txt

2015-07-11 Thread Daniel Naber
On 2015-07-10 20:18, Marco A.G.Pinto wrote: Could someone activate it? After it is activated, I will remove them from grammar.xml . We'll need some strings for that rule: -Message -Description -Short description -Two examples, one incorrect and one correct Also see

Re: Fwd: AOO 5.0 - Portuguese dictionary pre- and -post reform

2015-07-10 Thread Daniel Naber
On 2015-07-10 15:13, Marco A.G.Pinto wrote: After it is implemented, does that mean that finally we will be able to use the compounds pre- and post- pt_PT language agreement like I suggested a year or two ago? My understanding is that if you can select a language and its variant in the

Re: thanking contributors

2015-07-04 Thread Daniel Naber
On 2015-07-04 12:40, Richard Eckart de Castilho wrote: Hi Richard, Not sure if you consider it worth mentioning because it's not a code contribution... sure your contribution is worth mentioning, I've just added you to the page. Regards Daniel

Java 8

2015-07-01 Thread Daniel Naber
Hi, we decided to switch to Java 8 with LT 3.2 (planned release: 2015-12-28). I just needed to parallelize some code in languagetool-dev, and as -dev is not part of the release I decided to make switch for that module now. So if you make changes in languagetool-dev, which only contains

Re: reminder: feature freeze for 3.0

2015-06-24 Thread Daniel Naber
On 2015-06-22 21:05, Daniel Naber wrote: Finally, please help to keep the translations up-to-date: https://www.transifex.com/projects/p/languagetool/ It took some time, but transifex has now been updated and for most languages there are two strings that need translation. Regards

Re: reminder: feature freeze for 3.0

2015-06-24 Thread Daniel Naber
On 2015-06-24 15:16, Marco A.G.Pinto wrote: In Transifex, what is: ngram data directory: (ngram) See http://wiki.languagetool.org/finding-errors-using-big-data and https://en.wikipedia.org/wiki/N-gram Regards Daniel

new committer: Peter Lawrence

2015-06-19 Thread Daniel Naber
Hi, I'd like to welcome Peter Lawrence as a new committer. If you haven't noticed him yet, that's because he has contributed his high-quality rules for English via our forum. We're looking forward to your contributions, Peter! Regards Daniel

reminder: upcoming feature freeze

2015-06-17 Thread Daniel Naber
Hi, this is just a friendly reminder that feature freeze is going to start next Monday, so if you need to make larger changes or changes that need translations, please do so before that date. The release of LT 3.0 is planned for 2015-06-29. http://wiki.languagetool.org/roadmap Regards

Re: Regarding Integration of LanguageTool with VisualEditor

2015-06-06 Thread Daniel Naber
On 2015-06-06 22:42, Ankita Kumari wrote: Here is the access log for the same : https://gist.github.com/ankita-kumari/608f2fe0e05dd75dd59b [5] I think you should first try to fix this problem: There is insufficient memory for the Java Runtime Environment to continue. It means Java isn't

Re: Using the elements 'or' and 'marker' together: possible 'testrules' error

2015-06-01 Thread Daniel Naber
On 2015-05-28 17:05, Mike Unwalla wrote: Testrules gives this error message: Invalid content was found starting with element 'or'. One of '{token, unify, and, phraseref, unify-ignore}' is expected. However, LT seems to operate correctly, as the attached screen shot shows. Am I doing

Re: confusion probability rule now online

2015-05-29 Thread Daniel Naber
On 2015-05-29 18:13, Jan Schreiber wrote: Another word pair came to my mind for which this method might work: prove - proof It works quite well (precision=0.996, recall=0.856), so I've added it. As 'proof' can also be a verb, e.g. short for proofread, there are some cases that are ambiguous,

Re: confusion probability rule now online

2015-05-27 Thread Daniel Naber
On 2015-05-27 19:45, Jan Schreiber wrote: this is really awesome! Is it possible to extend this to abbreviations with an apostrophe? Not yet, but it's on my TODO list. Other important cases: then - than effect - affect I'll try to add these. Regards Daniel

Re: Regarding running LanguageTool HTTP server on Wikimedia Tools-Lab

2015-05-24 Thread Daniel Naber
On 2015-05-24 14:10, Ankita Kumari wrote: Yes, of course. I am talking about its integration with tinymce. That seems neatly done. If there was some documentation on that, it would be helpful. Because I think I will manage till the part where the languagetool server is queried. But I am stuck

Re: Problems with LibreOffice 4.2x on Windows

2015-05-14 Thread Daniel Naber
On 2015-05-14 14:59, Dmitri Gabinski wrote: Could you please confirm that this version also works? https://languagetool.org/download/snapshots/LanguageTool-2.9.1-beta.oxt [1] Yes, it works for me. Thanks for the feedback. I've released this version as 2.9.1 now on

Re: Error while trying to run Server LanguageTool HTTP

2015-05-13 Thread Daniel Naber
On 2015-05-13 16:52, Ankita Kumari wrote: This is the error I get : https://gist.github.com/ankita-kumari/b5da7d7fdae3078dcd88 [1] I have run it without any error before. Please let me know if you can help me out with this. Caused by: java.lang.ClassNotFoundException:

Re: Problems with LibreOffice 4.2x on Windows

2015-05-13 Thread Daniel Naber
On 2015-05-12 15:04, Dmitri Gabinski wrote: The recent fix by Yakov Yakov, thanks! I'm happy to report that yesterday's snapshot runs fine, I can work with a file that previously caused immediate crashes. Could you please confirm that this version also works?

Re: Good way to handle invalid prolonged sound mark

2015-05-13 Thread Daniel Naber
On 2015-05-13 07:43, Takatsugu Nokubi wrote: ー (prolonged sound mark) is a popular symbol in Japanese. And the rule itself is simple: The symbol is placed after Hiragana or Katakana, not Kanji. If the scripts (Hiragana, Katakana, Kanji) have non-overlapping Unicode ranges, it should be

Re: Using result from AnalyzedToken.getLemma()

2015-05-12 Thread Daniel Naber
On 2015-05-12 10:27, Richard Eckart de Castilho wrote: For some wordforms, the lemma appears to contain some structuring, e.g. besitzt becomes [be]sitzen (the brackets are actually in the string returned by getLemma). This is the original data as exported from Morphy:

free JProfiler licenses

2015-05-11 Thread Daniel Naber
Hi, we have now 5 licenses of JProfiler available for LT contributors for their work on LT. It's a powerful Java profiler that can be used for performance and memory analysis. I've used it in the last days to optimize memory when checking German text. If you want a license, let me know.

Re: command-line XML output

2015-05-09 Thread Daniel Naber
On 2015-05-09 11:15, Jaume Ortolà i Font wrote: I need to use the command-line XML output (with the --api option). The list of unkown words is needed but it is missing. Can we add this list to the XML? I have no problem with that. Regards Daniel

Re: Problems with LibreOffice 4.2x on Windows

2015-05-08 Thread Daniel Naber
On 2015-05-08 11:00, Dmitri Gabinski wrote: But why does it matter for LO 4.4.x only? No problem with LO 4.3, neither with AOO 4.1.1. Different users have different (probably memory-related) problems, and all versions between 4.1 to 4.4 seem to be affected more or less. Regards Daniel

Re: de tests fail

2015-05-07 Thread Daniel Naber
On 2015-05-07 22:01, Andriy Rysin wrote: I just checked out fresh copy of LT from git and Germal tests fail with following exception, is this just me or there's a problem? There's a problem, I will fix it soon. Regards Daniel

Re: de tests fail

2015-05-07 Thread Daniel Naber
On 2015-05-07 22:01, Andriy Rysin wrote: I just checked out fresh copy of LT from git and Germal tests fail with following exception, is this just me or there's a problem? Should be fixed now. Regards Daniel --

Re: Last night's LanguageTool nightly diff test

2015-05-06 Thread Daniel Naber
On 2015-05-06 08:46, Marco A.G.Pinto wrote: I haven't received the last night e-mail. There was no email due to server problems. Regards Daniel -- One dashboard for servers and applications across

LT vs. MS Word grammar checker

2015-05-06 Thread Daniel Naber
Hi, I've run a test comparing LT to the grammar checker of MS Word 2013. The result for German: 55% of the errors that LT can detect are *not* found by MS Word. Here's exactly what we did, with a C# program someone wrote for me: -Get all incorrect example sentences from the grammar.xml, the

Re: Multiple zero of min occurances

2015-04-30 Thread Daniel Naber
On 2015-04-29 19:38, Andriy Rysin wrote: Hi Andriy, I wrote little patch with tests that make my rules work but unfortunately it breaks 1 rule in Catalan (where two consequitive min 0s are used). All other languages tests pass. the test case looks correct, please just update the regex

Re: Fwd: LanguageTool nightly diff test

2015-04-25 Thread Daniel Naber
On 2015-04-24 23:38, Marco A.G.Pinto wrote: However, tonight I received another diff that still shows the errors I have fixed. The problem was that it was also triggering the word e instead of a: -LINE 1, COLUMN 86, RULE ID: A_AS2[1] This is a diff, i.e. the lines that start with a -

Java 7 EOL, switching to Java 8?

2015-04-24 Thread Daniel Naber
Hi, Java 7 has now reached its end of life, i.e. there won't be any further security updates publicly available. Thus now it's the right time to discuss how long we're going to support Java 7. The last switch from Java 6 to Java 7 was with version 2.3 on 2013-09-30. Java 8 offers nice

please test languagetool.org on tablets

2015-04-22 Thread Daniel Naber
Hi, if you have a tablet device, could you try languagetool.org on it and post your results here? For example, does clicking on errors and selecting the correction work? Does changing the language work? Does full screen mode work? I tried using Sauce Labs, which is some kind of emulator, but

changes to when Rule.reset() is called

2015-04-21 Thread Daniel Naber
Hi, I made this change to the API: * `getAllRules()`, `getAllActiveRules()`, and `getPatternRulesByIdAndSubId()` in class `JLanguageTool` used to call `reset()` for all rules. This is not the case anymore. `reset()` is now called when one of the `check()` methods is called.

<    1   2   3   4   5   6   7   8   9   10   >