[LanguageTool] [ languagetool-Bugs-3537500 ] ArrayIndexOutOfBoundsException
Bugs item #3537500, was opened at 2012-06-24 02:37 Message generated for change (Comment added) made by dnaber You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3537500group_id=110216 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: 1.7 Status: Closed Resolution: Works For Me Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: ArrayIndexOutOfBoundsException Initial Comment: Happened just-like-that. I was in the middle of writing some word in LibreOffice Writer and suddenly I got two pop-ups with this java exception (language used - PL). Screen attached. Ofc LanguageTool crashed. -- Comment By: Daniel Naber (dnaber) Date: 2012-06-26 09:39 Message: Marcin points out that this is probably fixed in 1.8, so closing it. -- Comment By: Daniel Naber (dnaber) Date: 2012-06-25 13:03 Message: I have committed a workaround in AnalyzedTokenReadings until the real reason is found. -- Comment By: Marcin Miłkowski (milek_pl) Date: 2012-06-24 07:34 Message: Well, what was the word that you were typing? -- You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3537500group_id=110216 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7527] trunk/JLanguageTool/src/java/org/ languagetool/gui/Main.java
Revision: 7527 http://languagetool.svn.sourceforge.net/languagetool/?rev=7527view=rev Author: dnaber Date: 2012-06-27 16:01:45 + (Wed, 27 Jun 2012) Log Message: --- initialize config before we open the config dialog - fixes loss of configuration Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/gui/Main.java Modified: trunk/JLanguageTool/src/java/org/languagetool/gui/Main.java === --- trunk/JLanguageTool/src/java/org/languagetool/gui/Main.java 2012-06-26 21:40:14 UTC (rev 7526) +++ trunk/JLanguageTool/src/java/org/languagetool/gui/Main.java 2012-06-27 16:01:45 UTC (rev 7527) @@ -459,9 +459,9 @@ private JLanguageTool getCurrentLanguageTool(Language currentLanguage) { final JLanguageTool langTool; try { + config = new Configuration(new File(System.getProperty(user.home)), CONFIG_FILE, currentLanguage); final ConfigurationDialog configDialog = getCurrentConfigDialog(currentLanguage); langTool = new JLanguageTool(currentLanguage, configDialog.getMotherTongue()); - config = new Configuration(new File(System.getProperty(user.home)), CONFIG_FILE, currentLanguage); langTool.activateDefaultPatternRules(); langTool.activateDefaultFalseFriendRules(); final SetString disabledRules = configDialog.getDisabledRuleIds(); This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7528] trunk/JLanguageTool/src/java/org/ languagetool/gui/ConfigurationDialog.java
Revision: 7528 http://languagetool.svn.sourceforge.net/languagetool/?rev=7528view=rev Author: dnaber Date: 2012-06-27 16:11:35 + (Wed, 27 Jun 2012) Log Message: --- bugfix: enabling rules which were disabled by default didn't work Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/gui/ConfigurationDialog.java Modified: trunk/JLanguageTool/src/java/org/languagetool/gui/ConfigurationDialog.java === --- trunk/JLanguageTool/src/java/org/languagetool/gui/ConfigurationDialog.java 2012-06-27 16:01:45 UTC (rev 7527) +++ trunk/JLanguageTool/src/java/org/languagetool/gui/ConfigurationDialog.java 2012-06-27 16:11:35 UTC (rev 7528) @@ -147,8 +147,12 @@ checkBox.setSelected(true); } -if (rule.isDefaultOff() !enabledRuleIds.contains(rule.getId())) { - checkBox.setSelected(false); +if (rule.isDefaultOff()) { + if (enabledRuleIds.contains(rule.getId())) { +checkBox.setSelected(true); + } else { +checkBox.setSelected(false); + } } if (rule.isDefaultOff()) { This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7529] trunk/JLanguageTool/src/rules/eo/grammar. xml
Revision: 7529 http://languagetool.svn.sourceforge.net/languagetool/?rev=7529view=rev Author: dominikoeo Date: 2012-06-27 17:47:32 + (Wed, 27 Jun 2012) Log Message: --- [eo] fixed false positives and removed a rule which was wrong. Modified Paths: -- trunk/JLanguageTool/src/rules/eo/grammar.xml Modified: trunk/JLanguageTool/src/rules/eo/grammar.xml === --- trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-27 16:11:35 UTC (rev 7528) +++ trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-27 17:47:32 UTC (rev 7529) @@ -430,15 +430,6 @@ example type=incorrectmarkersendifina modo/marker./example example type=correctinfinitivo/example /rule -rule id=JAM_FUTURO name=jam + futuro - pattern -tokenjam/token -token postag=V .. os postag_regexp=yes/ - /pattern - message«jam + futuro» estas iafoje uzata en aparta evitinda senco laŭ Revo. Ĝi povas signifi «fine, pli-malpli, baldaŭ»./message - example type=incorrectKun la tempo, ili markerjam solidiĝos/marker./example - example type=correctKun la tempo, ili fine solidiĝos./example -/rule rule id=KODIGO name=kodigo pattern marker @@ -1636,7 +1627,7 @@ /token marker token postag=[OA] akz.*|T akz .*[uoa] postag_regexp=yes -exception inflected=yes regexp=yes(duono?)?(jarcent|jardek|jar|somer|vintr|printemp|aŭtun|sezon|monat|semajn|tag|maten|vesper|(post?)tagmez|nokt|hor|minut|sekund|moment|.*metr|mejl|marmejl|fut|col|paŝ|lund|mard|merkred|ĵaŭd|vendred|sabat|dimanĉ|foj|temp|.*gram|funt)o?(fin)?(e[tg])?o/exception +exception inflected=yes regexp=yes(duono?)?(jarcent|jardek|jar|somer|vintr|printemp|aŭtun|sezon|monat|semajn|tag|maten|vesper|(post?)tagmez|nokt|hor|minut|sekund|moment|.*metr|mejl|marmejl|fut|col|paŝ|lund|mard|merkred|ĵaŭd|vendred|sabat|dimanĉ|foj|temp|.*gram|funt|tun)o?(fin)?(e[tg])?o/exception exception regexp=yes([0-9]+-?an)|(\p{Lu}.*)/exception exception postag=T akz .. k .* postag_regexp=yes/ /token @@ -1713,7 +1704,7 @@ token postag=V t[rn] .* postag_regexp=yes skip=2 exception scope=next postag=[KP].*|[RO] akz.*|T .* k .*|V nt .* postag_regexp=yes/ exception scope=next regexp=yes[!?-–,;:«»“()]|ĉu|des|se|ol|ke|,/exception - exception inflected=yes regexp=yesnomi|konsideri|kredi|trovi|opinii|taksi|senti/exception + exception inflected=yes regexp=yesnomi|konsideri|kredi|trovi|opinii|taksi|senti|signifi/exception /token marker token postag=[ROAT] nak.* postag_regexp=yes This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7530] trunk/JLanguageTool
Revision: 7530 http://languagetool.svn.sourceforge.net/languagetool/?rev=7530view=rev Author: milek_pl Date: 2012-06-27 18:30:34 + (Wed, 27 Jun 2012) Log Message: --- bugfix update of morfologik-speller, update Catalan dict Modified Paths: -- trunk/JLanguageTool/libs/morfologik-speller-1.5.4.jar trunk/JLanguageTool/src/resource/ca/hunspell/ca_ES.dict Modified: trunk/JLanguageTool/libs/morfologik-speller-1.5.4.jar === (Binary files differ) Modified: trunk/JLanguageTool/src/resource/ca/hunspell/ca_ES.dict === (Binary files differ) This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7531] trunk/JLanguageTool
Revision: 7531 http://languagetool.svn.sourceforge.net/languagetool/?rev=7531view=rev Author: jaumeortola Date: 2012-06-27 19:21:00 + (Wed, 27 Jun 2012) Log Message: --- [ca] Added source of spellcheck dictionary Modified Paths: -- trunk/JLanguageTool/CHANGES.txt trunk/JLanguageTool/src/rules/ca/grammar.xml Modified: trunk/JLanguageTool/CHANGES.txt === --- trunk/JLanguageTool/CHANGES.txt 2012-06-27 18:30:34 UTC (rev 7530) +++ trunk/JLanguageTool/CHANGES.txt 2012-06-27 19:21:00 UTC (rev 7531) @@ -30,6 +30,7 @@ -new synthesizer (given a part-of-speech, this generates the inflected word forms) (Jaume Ortolà i Font) -added spellcheck dictionary for the standalone version +using Hunspell dictionary ca-valencia (avl) 2.3.0 by softcatala -Russian: -fixed a few false alarms Modified: trunk/JLanguageTool/src/rules/ca/grammar.xml === --- trunk/JLanguageTool/src/rules/ca/grammar.xml2012-06-27 18:30:34 UTC (rev 7530) +++ trunk/JLanguageTool/src/rules/ca/grammar.xml2012-06-27 19:21:00 UTC (rev 7531) @@ -659,7 +659,7 @@ /unify /marker !-- excloure totes les expressions temporals. Falta: els darrers vint anys, els caps de setmana... la major part, la majoria -- -token regexp=yes negate=yes\d+exception inflected=yes regexp=yesvegada|volta|cop|any|dia|mes|segle|setmana|gener|febrer|març|abril|maig|juny|juliol|agost|setembre|octubre|novembre|desembre/exception/token +token regexp=yes negate=yes[\d-]+exception inflected=yes regexp=yesvegada|volta|cop|any|dia|mes|segle|setmana|gener|febrer|març|abril|maig|juny|juliol|agost|setembre|octubre|novembre|desembre/exception/token tokenexception inflected=yes regexp=yesvegada|volta|cop|any|dia|mes|segle|setmana|gener|febrer|març|abril|maig|juny|juliol|agost|setembre|octubre|novembre|desembre/exception/token !-- excepcions: una altra vegada, els caps de setmana... -- tokenexception inflected=yes regexp=yesvegada|volta|cop|any|dia|mes|segle|setmana|gener|febrer|març|abril|maig|juny|juliol|agost|setembre|octubre|novembre|desembre/exception/token tokenexception inflected=yes regexp=yesvegada|volta|cop|any|dia|mes|segle|setmana/exception/token @@ -680,8 +680,6 @@ token postag=D[DAI].* postag_regexp=yesexception postag=RG|_GV_ postag_regexp=yes/exception regexp=yes\p{Lu}[\p{Ll}·]*/exception/token /unify /marker - -token regexp=yes negate=yes\d+/token /pattern messageError de concordança./message shortError de concordança./short This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7532] trunk/JLanguageTool/src
Revision: 7532 http://languagetool.svn.sourceforge.net/languagetool/?rev=7532view=rev Author: dnaber Date: 2012-06-27 19:25:01 + (Wed, 27 Jun 2012) Log Message: --- [de] fixing some false alarms Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/rules/de/AgreementRule.java trunk/JLanguageTool/src/java/org/languagetool/rules/de/CaseRule.java trunk/JLanguageTool/src/resource/de/added.txt trunk/JLanguageTool/src/resource/de/words-correct.txt trunk/JLanguageTool/src/resource/en/errors.txt trunk/JLanguageTool/src/rules/de/grammar.xml Modified: trunk/JLanguageTool/src/java/org/languagetool/rules/de/AgreementRule.java === --- trunk/JLanguageTool/src/java/org/languagetool/rules/de/AgreementRule.java 2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/java/org/languagetool/rules/de/AgreementRule.java 2012-06-27 19:25:01 UTC (rev 7532) @@ -186,8 +186,8 @@ if (i 0) { final String prevToken = tokens[i-1].getToken().toLowerCase(); if ((prevToken.equals(der) || prevToken.equals(die) || prevToken.equals(das)) - tokens[i].getToken().equals(eine)) { - // TODO: der eine Polizist - nicht ignorieren, sondern der polizist checken + (tokens[i].getToken().equals(eine) || tokens[i].getToken().equals(einen))) { + // TODO: der eine Polizist - nicht ignorieren, sondern der polizist checken; auf der einen Seite ignore = true; } } Modified: trunk/JLanguageTool/src/java/org/languagetool/rules/de/CaseRule.java === --- trunk/JLanguageTool/src/java/org/languagetool/rules/de/CaseRule.java 2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/java/org/languagetool/rules/de/CaseRule.java 2012-06-27 19:25:01 UTC (rev 7532) @@ -79,6 +79,7 @@ private static final SetString exceptions = new HashSetString(); static { +exceptions.add(Hr); // Hr. = Abkürzung für Herr exceptions.add(Gesandter); exceptions.add(Durchfahrt); exceptions.add(Durchgriff); Modified: trunk/JLanguageTool/src/resource/de/added.txt === --- trunk/JLanguageTool/src/resource/de/added.txt 2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/resource/de/added.txt 2012-06-27 19:25:01 UTC (rev 7532) @@ -23,3 +23,11 @@ Taxis TaxiSUB:GEN:PLU:NEU Taxis TaxiSUB:DAT:PLU:NEU Taxis TaxiSUB:AKK:PLU:NEU +Diagramm DiagrammSUB:NOM:SIN:NEU +Diagramms DiagrammSUB:GEN:SIN:NEU +Diagramm DiagrammSUB:DAT:SIN:NEU +Diagramm DiagrammSUB:AKK:SIN:NEU +Diagramme DiagrammSUB:NOM:PLU:NEU +Diagramme DiagrammSUB:GEN:PLU:NEU +Diagrammen DiagrammSUB:DAT:PLU:NEU +Diagramme DiagrammSUB:AKK:PLU:NEU Modified: trunk/JLanguageTool/src/resource/de/words-correct.txt === --- trunk/JLanguageTool/src/resource/de/words-correct.txt 2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/resource/de/words-correct.txt 2012-06-27 19:25:01 UTC (rev 7532) @@ -1041,3 +1041,12 @@ pseudonymisieren Unfallhergängen Lexikons +zielführend +zuordenbar +Morphem +Morpheme +Morphems +forschungsrelevanter +patientenbezogener +pseudonymisierter +Unumkehrbarkeit Modified: trunk/JLanguageTool/src/resource/en/errors.txt === --- trunk/JLanguageTool/src/resource/en/errors.txt 2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/resource/en/errors.txt 2012-06-27 19:25:01 UTC (rev 7532) @@ -1,6 +1,7 @@ -- Real world errors: -- +...which is twice as slower[slow] as other rules. Bad us[use] of interpunction. Please make me correct[correct me] If I'm wrong You an[can] set the mapred.child.java.opts on a per job basis. Modified: trunk/JLanguageTool/src/rules/de/grammar.xml === --- trunk/JLanguageTool/src/rules/de/grammar.xml2012-06-27 19:21:00 UTC (rev 7531) +++ trunk/JLanguageTool/src/rules/de/grammar.xml2012-06-27 19:25:01 UTC (rev 7532) @@ -6124,7 +6124,21 @@ !-- Siehe auch die Regel ZU_ZUMUTEN. -- rule pattern -token postag=.*PRP:.* postag_regexp=yesexception scope=currentgegenüber/exceptionexception scope=currentanstatt/exceptionexception scope=currentgleich/exceptionexception scope=currentnach/exceptionexception scope=currentohne/exceptionexception scope=currententsprechend/exceptionexception scope=currentum/exceptionexception scope=currentals/exceptionexception scope=currentan/exceptionexception
[LanguageTool] SF.net SVN: languagetool:[7533] trunk/JLanguageTool/src
Revision: 7533 http://languagetool.svn.sourceforge.net/languagetool/?rev=7533view=rev Author: dnaber Date: 2012-06-27 19:56:28 + (Wed, 27 Jun 2012) Log Message: --- adding tests; small code/javadoc cleanup Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/Language.java trunk/JLanguageTool/src/test/org/languagetool/LanguageTest.java Modified: trunk/JLanguageTool/src/java/org/languagetool/Language.java === --- trunk/JLanguageTool/src/java/org/languagetool/Language.java 2012-06-27 19:25:01 UTC (rev 7532) +++ trunk/JLanguageTool/src/java/org/languagetool/Language.java 2012-06-27 19:56:28 UTC (rev 7533) @@ -28,6 +28,7 @@ import java.util.MissingResourceException; import java.util.ResourceBundle; +import org.languagetool.databroker.ResourceDataBroker; import org.languagetool.language.AmericanEnglish; import org.languagetool.language.Asturian; import org.languagetool.language.AustralianEnglish; @@ -172,13 +173,14 @@ /** * Get this language's two character code, e.g. codeen/code for English. - * @return String - language code + * The variant (US), if any, is not returned. + * @return language code */ public abstract String getShortName(); /** * Get this language's name in English, e.g. codeEnglish/code or codeGerman/code. - * @return String - language name + * @return language name */ public abstract String getName(); @@ -208,26 +210,28 @@ // - /** - * Get the location of the rule file. + * Get the location of the rule file(s). */ public ListString getRuleFileName() { - ListString ruleFiles = new ArrayListString(); - ruleFiles.add(JLanguageTool.getDataBroker().getRulesDir() + - / + getShortName() + / + JLanguageTool.PATTERN_FILE); - if (getShortNameWithVariant().length() 2) { - final String fileName = getShortName() + / + - getShortNameWithVariant() + - / + JLanguageTool.PATTERN_FILE; - if (JLanguageTool.getDataBroker().ruleFileExists(fileName)) { - ruleFiles.add(JLanguageTool.getDataBroker().getRulesDir() + - / + fileName); - } +final ListString ruleFiles = new ArrayListString(); +final ResourceDataBroker dataBroker = JLanguageTool.getDataBroker(); +ruleFiles.add(dataBroker.getRulesDir() ++ / + getShortName() + / + JLanguageTool.PATTERN_FILE); +if (getShortNameWithVariant().length() 2) { + final String fileName = getShortName() + / + + getShortNameWithVariant() + + / + JLanguageTool.PATTERN_FILE; + if (dataBroker.ruleFileExists(fileName)) { +ruleFiles.add(dataBroker.getRulesDir() + / + fileName); } - return ruleFiles; +} +return ruleFiles; } /** * Languages that have variants need to overwrite this to select their most common variant. + * @return default variant or codenull/code + * @since 1.8 */ public Language getDefaultVariant() { return null; @@ -305,6 +309,7 @@ * Get the short name of the language with a country variant, if it is * a single-variant language. For generic language classes, get only a two- or * three-character code. + * @since 1.8 */ public final String getShortNameWithVariant() { String name = getShortName(); @@ -357,7 +362,7 @@ /** * Return all languages supported by LanguageTool. - * @return A list of all languages, including external ones. + * @return A list of all languages, including external ones and variants (e.g. en-US) */ public static ListLanguage getAllLanguages() { final ListLanguage langList = new ArrayListLanguage(); @@ -420,6 +425,7 @@ * Get the best match for a locale, using American English as the final fallback if nothing * else fits. The returned language will be a variant language (e.g. British English, not just English) * if available. + * @since 1.8 */ public static Language getLanguageForLocale(final Locale locale) { Language language = getLanguageForLanguageNameAndCountry(locale); @@ -473,11 +479,8 @@ /** * Get sorted info about all maintainers (without language variants) to be used in the About dialog. * @since 0.9.9 - * @param messages - *{{@link ResourceBundle} language bundle to translate - *the info - * @return - *A sorted list of maintainers. + * @param messages {{@link ResourceBundle} language bundle to translate the info + * @return A list of maintainers, sorted by name of language. */ public static String getAllMaintainers(final ResourceBundle messages) { final StringBuilder maintainersInfo = new StringBuilder(); @@ -505,6 +508,7 @@ /** * Whether this is a variant of another language,
[LanguageTool] SF.net SVN: languagetool:[7535] trunk/JLanguageTool/src/java/org/ languagetool/language/Russian.java
Revision: 7535 http://languagetool.svn.sourceforge.net/languagetool/?rev=7535view=rev Author: yakovru Date: 2012-06-27 21:02:45 + (Wed, 27 Jun 2012) Log Message: --- [ru] add morfologik-speller dictionary for Russian Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/language/Russian.java Modified: trunk/JLanguageTool/src/java/org/languagetool/language/Russian.java === --- trunk/JLanguageTool/src/java/org/languagetool/language/Russian.java 2012-06-27 21:02:21 UTC (rev 7534) +++ trunk/JLanguageTool/src/java/org/languagetool/language/Russian.java 2012-06-27 21:02:45 UTC (rev 7535) @@ -34,7 +34,8 @@ import org.languagetool.rules.ru.RussianSimpleReplaceRule; import org.languagetool.rules.ru.RussianUnpairedBracketsRule; import org.languagetool.rules.ru.RussianWordRepeatRule; -import org.languagetool.rules.spelling.hunspell.HunspellNoSuggestionRule; +//import org.languagetool.rules.spelling.hunspell.HunspellNoSuggestionRule; +import org.languagetool.rules.ru.MorfologikRussianSpellerRule; import org.languagetool.synthesis.Synthesizer; import org.languagetool.synthesis.ru.RussianSynthesizer; import org.languagetool.tagging.Tagger; @@ -123,7 +124,8 @@ CommaWhitespaceRule.class, DoublePunctuationRule.class, UppercaseSentenceStartRule.class, -HunspellNoSuggestionRule.class, + // HunspellNoSuggestionRule.class, +MorfologikRussianSpellerRule.class, WordRepeatRule.class, WhitespaceRule.class, // specific to Russian : This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7537] trunk/JLanguageTool/src
Revision: 7537 http://languagetool.svn.sourceforge.net/languagetool/?rev=7537view=rev Author: dnaber Date: 2012-06-27 21:50:50 + (Wed, 27 Jun 2012) Log Message: --- small code cleanup Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/language/LanguageBuilder.java trunk/JLanguageTool/src/java/org/languagetool/rules/AbstractSimpleReplaceRule.java trunk/JLanguageTool/src/java/org/languagetool/rules/en/BritishReplaceRule.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/BritishReplaceRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikAmericanSpellerRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikAustralianSpellerRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikBritishSpellerRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikCanadianSpellerRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikNewZealandSpellerRuleTest.java trunk/JLanguageTool/src/test/org/languagetool/rules/en/MorfologikSouthAfricanSpellerRuleTest.java Modified: trunk/JLanguageTool/src/java/org/languagetool/language/LanguageBuilder.java === --- trunk/JLanguageTool/src/java/org/languagetool/language/LanguageBuilder.java 2012-06-27 21:04:50 UTC (rev 7536) +++ trunk/JLanguageTool/src/java/org/languagetool/language/LanguageBuilder.java 2012-06-27 21:50:50 UTC (rev 7537) @@ -75,14 +75,16 @@ } @Override public String getShortName() { -if (parts[1].length() == 2) -return parts[1]; +if (parts[1].length() == 2) { + return parts[1]; +} return parts[1].split(_)[0]; //en as in en_US } @Override public String[] getCountryVariants() { -if (parts[1].length() == 2) -return new String[] {}; +if (parts[1].length() == 2) { + return new String[] {}; +} return new String[] {parts[1].split(_)[1]}; //US as in en_US } @Override @@ -95,7 +97,7 @@ } @Override public ListString getRuleFileName() { - ListString ruleFiles = new ArrayListString(); + final ListString ruleFiles = new ArrayListString(); ruleFiles.add(file.getAbsolutePath()); return ruleFiles; } Modified: trunk/JLanguageTool/src/java/org/languagetool/rules/AbstractSimpleReplaceRule.java === --- trunk/JLanguageTool/src/java/org/languagetool/rules/AbstractSimpleReplaceRule.java 2012-06-27 21:04:50 UTC (rev 7536) +++ trunk/JLanguageTool/src/java/org/languagetool/rules/AbstractSimpleReplaceRule.java 2012-06-27 21:50:50 UTC (rev 7537) @@ -45,7 +45,7 @@ private static final String FILE_ENCODING = utf-8; - private MapString, String wrongWords; // e.g. вреѿті реѿт - зреѿтою + private final MapString, String wrongWords; // e.g. вреѿті реѿт - зреѿтою public abstract String getFileName(); @@ -102,7 +102,7 @@ final String token = tokens[i].getToken(); final String origToken = token; - final String replacement = isCaseSensitive()?wrongWords.get(token):wrongWords.get(token.toLowerCase(getLocale())); + final String replacement = isCaseSensitive() ? wrongWords.get(token) : wrongWords.get(token.toLowerCase(getLocale())); if (replacement != null) { final String msg = token + getSuggestion() + replacement; final int pos = tokens[i].getStartPos(); Modified: trunk/JLanguageTool/src/java/org/languagetool/rules/en/BritishReplaceRule.java === --- trunk/JLanguageTool/src/java/org/languagetool/rules/en/BritishReplaceRule.java 2012-06-27 21:04:50 UTC (rev 7536) +++ trunk/JLanguageTool/src/java/org/languagetool/rules/en/BritishReplaceRule.java 2012-06-27 21:50:50 UTC (rev 7537) @@ -28,9 +28,6 @@ * A rule that matches words or phrases which should not be used and suggests * correct ones instead. * - * Polish implementations. Loads the list of words from - * coderules/pl/replace.txt/code. - * * @author Marcin Miłkowski */ public class BritishReplaceRule extends AbstractSimpleReplaceRule { Modified: trunk/JLanguageTool/src/test/org/languagetool/rules/en/BritishReplaceRuleTest.java === --- trunk/JLanguageTool/src/test/org/languagetool/rules/en/BritishReplaceRuleTest.java 2012-06-27 21:04:50 UTC (rev 7536) +++ trunk/JLanguageTool/src/test/org/languagetool/rules/en/BritishReplaceRuleTest.java 2012-06-27 21:50:50 UTC (rev 7537) @@ -28,12 +28,6 @@ import org.languagetool.TestTools; import org.languagetool.rules.RuleMatch; -/** - * - * Simple tests for rules/pl/SimpleReplaceRule class - * -
[LanguageTool] SF.net SVN: languagetool:[7538] trunk/JLanguageTool
Revision: 7538 http://languagetool.svn.sourceforge.net/languagetool/?rev=7538view=rev Author: jaumeortola Date: 2012-06-27 22:00:13 + (Wed, 27 Jun 2012) Log Message: --- [ca] Fixed some false alarms. One new pattern rule: EN_LA_QUE. Test if the speller uses the correct dictionary. Modified Paths: -- trunk/JLanguageTool/CHANGES.txt trunk/JLanguageTool/src/resource/ca/disambiguation.xml trunk/JLanguageTool/src/rules/ca/grammar.xml trunk/JLanguageTool/src/test/org/languagetool/rules/ca/MorfologikCatalanSpellerRuleTest.java Modified: trunk/JLanguageTool/CHANGES.txt === --- trunk/JLanguageTool/CHANGES.txt 2012-06-27 21:50:50 UTC (rev 7537) +++ trunk/JLanguageTool/CHANGES.txt 2012-06-27 22:00:13 UTC (rev 7538) @@ -30,7 +30,7 @@ -new synthesizer (given a part-of-speech, this generates the inflected word forms) (Jaume Ortolà i Font) -added spellcheck dictionary for the standalone version -using Hunspell dictionary ca-valencia (avl) 2.3.0 by softcatala +using Hunspell dictionary ca-valencia (avl) 2.3.0 by Softcatalà -Russian: -fixed a few false alarms Modified: trunk/JLanguageTool/src/resource/ca/disambiguation.xml === --- trunk/JLanguageTool/src/resource/ca/disambiguation.xml 2012-06-27 21:50:50 UTC (rev 7537) +++ trunk/JLanguageTool/src/resource/ca/disambiguation.xml 2012-06-27 22:00:13 UTC (rev 7538) @@ -4468,7 +4468,7 @@ token postag=V.[MSI].*|_GV_ postag_regexp=yes negate_pos=yesexception postag=V.[MSI].*|_GV_ postag_regexp=yes//token marker and -token postag=NC.* postag_regexp=yesexception postag=_GN_.* postag_regexp=yes //token +token postag=NC.* postag_regexp=yesexception postag=_GN_.* postag_regexp=yes /exceptioncoma/exception/token token postag=V.[MSI].* postag_regexp=yesexception postag=_GV_ /exception regexp=yes\p{Lu}+[\p{Ll}·]*/exceptionexception regexp=yes[\p{Ll}·]+(e|[aei]sses|[ae]res|era)/exception/token !-- problema: els noms acabats en -e no tenen comprovació de concordança!! -- /and Modified: trunk/JLanguageTool/src/rules/ca/grammar.xml === --- trunk/JLanguageTool/src/rules/ca/grammar.xml2012-06-27 21:50:50 UTC (rev 7537) +++ trunk/JLanguageTool/src/rules/ca/grammar.xml2012-06-27 22:00:13 UTC (rev 7538) @@ -2442,7 +2442,7 @@ example type=incorrectLa possibilitat markerde que/marker ho facis/example example type=correctLa possibilitat markerque/marker ho facis/example /rule -rule id=des_de name=des de/que +rule id=DES_DE name=des de/que pattern marker tokendes/token @@ -2518,6 +2518,17 @@ example type=incorrectmarkerDes de que/marker m'ho vas dir./example example type=correctmarkerDes que/marker m'ho vas dir./example /rule +rule id=EN_LA_QUE name=*en la què +pattern +token postag=SPS00/ +token postag=DA.* postag_regexp=yes/ +tokenquè/token +/pattern +messageCal dir suggestionmatch no=1/ què/suggestion o suggestionmatch no=1/ match no=2/ match no=2 postag=DA..(.). postag_regexp=yes postag_replace=PR0C$1000qual/match/suggestion./message +shortExpressió incorrecta/short +example type=incorrectEl lloc markeren el què/marker el van trobar/example +example type=correctEl lloc markeren què/marker el van trobar/example +/rule /category category name=Repeticions rulegroup id=PHRASE_REPETITION name=Repetició de dues paraules ('a la a la') @@ -2908,7 +2919,8 @@ example type=incorrectmarkerSi/marker que vull./example example type=correctmarkerSí/marker que vull./example /rule -rule +!-- Les regles de concordança fan millor la detecció -- +!-- rule pattern tokensonexception postag=_GN.* postag_regexp=yes/exception/token /pattern @@ -2916,7 +2928,7 @@ shortS'accentua si és del verb ser./short example type=incorrectMés del que markerson/marker./example example type=correctMés del que markersón/marker./example -/rule +/rule -- rule pattern token postag=V.* postag_regexp=yes/token Modified: trunk/JLanguageTool/src/test/org/languagetool/rules/ca/MorfologikCatalanSpellerRuleTest.java === ---
[LanguageTool] SF.net SVN: languagetool:[7539] trunk/JLanguageTool/src
Revision: 7539 http://languagetool.svn.sourceforge.net/languagetool/?rev=7539view=rev Author: dominikoeo Date: 2012-06-28 04:26:31 + (Thu, 28 Jun 2012) Log Message: --- [fr] fixed false positive in: Je l'ai a peine vue. Une des r?\195?\169ponses est correcte. Modified Paths: -- trunk/JLanguageTool/src/resource/fr/disambiguation.xml trunk/JLanguageTool/src/rules/fr/grammar.xml Modified: trunk/JLanguageTool/src/resource/fr/disambiguation.xml === --- trunk/JLanguageTool/src/resource/fr/disambiguation.xml 2012-06-27 22:00:13 UTC (rev 7538) +++ trunk/JLanguageTool/src/resource/fr/disambiguation.xml 2012-06-28 04:26:31 UTC (rev 7539) @@ -1069,6 +1069,15 @@ /pattern disambig postag=A/ /rule +rule id=A_PEINE name=à peine + pattern +tokenà/token + marker +tokenpeine/token + /marker + /pattern + disambig postag=A/ +/rule rulegroup id=TOUT_A_FAIT name=tout à fait rule pattern Modified: trunk/JLanguageTool/src/rules/fr/grammar.xml === --- trunk/JLanguageTool/src/rules/fr/grammar.xml2012-06-27 22:00:13 UTC (rev 7538) +++ trunk/JLanguageTool/src/rules/fr/grammar.xml2012-06-28 04:26:31 UTC (rev 7539) @@ -2716,6 +2716,7 @@ example type=correctIl faut que tu le tues./example example type=correctUn mineur mouillé et barbouillé/example example type=correctIl y a un mois, Benoît XVI a markerinstitué/marker une commission formée de trois cardinaux./example +example type=correctJe l’ai à peine vue./example /rule rulegroup id=NON_V name=pronom sujet + nom/adj rule @@ -2939,7 +2940,7 @@ exception scope=nextet/exception /token token postag=Z . sp?|N . s|R pers suj.* s|R dem.* s postag_regexp=yes/token -token postag=V etre .* postag_regexp=yes/token +token postag=V etre .*s postag_regexp=yes/token marker token postag=V.* ppa .* p|J . p postag_regexp=yes/token /marker @@ -2965,7 +2966,7 @@ /token token postag=Z . sp?|N . s|R pers suj.* s|R dem.* s postag_regexp=yes/token token regexp=yes skip=1ne?/token -token postag=V etre .* postag_regexp=yes/token +token postag=V etre .*s postag_regexp=yes/token token regexp=yespas|point|guère|plus|jamais/token marker token postag=V.* ppa .* p|J . p postag_regexp=yes/token @@ -2984,7 +2985,7 @@ exception scope=nextet/exception /token token postag=Z . sp?|N . s|R pers suj.* s|R dem.* s postag_regexp=yes/token -token postag=V etre .* postag_regexp=yes/token +token postag=V etre .*s postag_regexp=yes/token token postag=A/token marker token postag=V.* ppa .* p|J . p postag_regexp=yes/token @@ -3002,7 +3003,7 @@ /token token postag=Z . sp?|N . s|R pers suj.* s|R dem.* s postag_regexp=yes/token token regexp=yes skip=1ne?/token -token postag=V etre .* postag_regexp=yes/token +token postag=V etre .*s postag_regexp=yes/token token regexp=yespas|point|guère|plus|jamais/token token postag=A/token marker @@ -3019,7 +3020,7 @@ exception scope=next postag=[NR].*sp?|V inf postag_regexp=yes/exception /token token postag=N . p|R pers suj [13].* p|R dem.* p postag_regexp=yes/token -token postag=V etre .* postag_regexp=yes/token +token postag=V etre .*p postag_regexp=yes/token marker token postag=V.* ppa .* s|J . s postag_regexp=yes exception regexp=yesmême|vite/exception @@ -3039,6 +3040,7 @@ example type=correctIls sont même allés faire la guerre./example example type=incorrectCelles-là sont markerfolle/marker./example example type=correctCelles-là sont folles./example +example type=correctUne des réponses est correcte./example /rule rule pattern @@ -3047,7 +3049,7 @@ /token token postag=N . p|R pers suj [13].* p|R dem.* p postag_regexp=yes/token token regexp=yes skip=1ne?/token