[LanguageTool] SF.net SVN: languagetool:[7360] trunk/JLanguageTool/src/rules/eo/grammar. xml
Revision: 7360 http://languagetool.svn.sourceforge.net/languagetool/?rev=7360view=rev Author: dominikoeo Date: 2012-06-16 06:33:15 + (Sat, 16 Jun 2012) Log Message: --- [eo] fixed typo in message + minor rule improvement. Modified Paths: -- trunk/JLanguageTool/src/rules/eo/grammar.xml Modified: trunk/JLanguageTool/src/rules/eo/grammar.xml === --- trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-16 05:47:49 UTC (rev 7359) +++ trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-16 06:33:15 UTC (rev 7360) @@ -209,7 +209,7 @@ rulegroup id=AKCENTO_AKCXENTO name=ankcento kaj akĉento rule pattern - token inflected=yes regexp=yes skip=1angla|araba|ĉarma|diversa|loka|fremda|fremdlanda|franca|germana|hebrea|itala|hispana|irlanda|usona|[lmnŝv]ian|terura|havi/token + token inflected=yes regexp=yes skip=1angla|araba|bela|bretona|ĉarma|diversa|loka|fremda|fremdlanda|franca|germana|hebrea|itala|hispana|irlanda|japana|portugala|rusa|skota|usona|[lmnŝv]ian|terura|havi|[mvlŝn]ia/token marker token inflected=yesakcento/token /marker @@ -3122,7 +3122,7 @@ token postag=O .* postag_regexp=yes/ token regexp=yeskiuj?n?/token /pattern -messageOni ne uzas «\3» post «\1». Konsidu uzi suggestion\2 de tiu, \3/suggestion./message +messageOni ne uzas «\3» post «\1». Konsideru uzi suggestion\2 de tiu, \3/suggestion./message urlhttp://bertilow.com/pmeg/gramatiko/subfrazoj/rilataj/kiu_kio_kies/kiu.html#i-52m/url example type=incorrectNe kredu markerties parolon kiu/marker tro multe promesas./example example type=correctNe kredu parolon de tiu, kiu tro multe promesas./example @@ -3134,7 +3134,7 @@ token,/token token regexp=yeskiuj?n?/token /pattern -messageOni ne uzas «\4» post «\1». Konsidu uzi suggestion\2 de tiu, \4/suggestion./message +messageOni ne uzas «\4» post «\1». Konsideru uzi suggestion\2 de tiu, \4/suggestion./message example type=incorrectNe kredu markerties parolon, kiu/marker tro multe promesas./example example type=correctNe kredu parolon de tiu, kiu tro multe promesas./example /rule This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7350] trunk/JLanguageTool/src/java/org/ languagetool/language/Catalan.java
Revision: 7350 http://languagetool.svn.sourceforge.net/languagetool/?rev=7350view=rev Author: jaumeortola Date: 2012-06-15 10:04:24 + (Fri, 15 Jun 2012) Log Message: --- Jaume Ortola, new Catalan mantainer. Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java Modified: trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java === --- trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java 2012-06-15 09:55:22 UTC (rev 7349) +++ trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java 2012-06-15 10:04:24 UTC (rev 7350) @@ -83,7 +83,7 @@ @Override public Contributor[] getMaintainers() { -return new Contributor[] {new Contributor(Ricard Roca)}; +return new Contributor[] {new Contributor(Ricard Roca), new Contributor(Jaume Ortolà) }; } @Override This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] [ languagetool-Bugs-3534851 ] LanguageToolGUI --tray option
Bugs item #3534851, was opened at 2012-06-13 05:11 Message generated for change (Tracker Item Submitted) made by celvin You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3534851group_id=110216 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Interface Group: 1.7 Status: Open Resolution: None Priority: 5 Private: No Submitted By: Brendel Oliver (celvin) Assigned to: Marcin Miłkowski (milek_pl) Summary: LanguageToolGUI --tray option Initial Comment: On the web-page it is stated : use the --tray option to start LanguageTool inside the system tray. After you copy any text to the clipboard, clicking LanguageTool in the system tray will cause the application to open and check the contents of the clipboard automatically This does not seem to work : after copying text to the clipboard and clicking on the LanguageTool icon in the system tray, LanguagTool will start, but it will start as it does normally, that is showing the standard test text and no verification done automatically. I tried this on a Windos 7 64 bit desktop with java 32 bits built 1,6,0_32-b05 Cheers Oliver -- You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3534851group_id=110216 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] [ languagetool-Bugs-3534637 ] footnotes after full stop brake case check
Bugs item #3534637, was opened at 2012-06-12 09:31 Message generated for change (Tracker Item Submitted) made by gulp21-1 You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3534637group_id=110216 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: gulp21 (gulp21-1) Assigned to: Nobody/Anonymous (nobody) Summary: footnotes after full stop brake case check Initial Comment: Example: Das ist ein Satz.1 Außerdem ist das hier auch ein Satz. Außerdem is marked as wrong. -- You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655717aid=3534637group_id=110216 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] [ languagetool-Feature Requests-3534852 ] GUI: spell check ?
Feature Requests item #3534852, was opened at 2012-06-13 05:15 Message generated for change (Settings changed) made by dnaber You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655720aid=3534852group_id=110216 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Interface Improvements (example) Group: Next Release (example) Status: Closed Priority: 5 Private: No Submitted By: Brendel Oliver (celvin) Assigned to: Nobody/Anonymous (nobody) Summary: GUI: spell check ? Initial Comment: it would be great if the LanguagToolGUI could also do a spell check. I suppose to apply the grammar rules, LanguagTool needs to parse the words in the text field and recognize them. The spell checker could be simply saying this word is not recognized without giving any choice to replace. This would already help to find simple typos thanks for considering this Oliver -- Comment By: Daniel Naber (dnaber) Date: 2012-06-14 12:18 Message: This is implemented in LanguageTool 1.8. -- You can respond by visiting: https://sourceforge.net/tracker/?func=detailatid=655720aid=3534852group_id=110216 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7361] trunk/JLanguageTool/src/dev/org/ languagetool/dev/index/Indexer.java
Revision: 7361 http://languagetool.svn.sourceforge.net/languagetool/?rev=7361view=rev Author: dnaber Date: 2012-06-16 09:04:14 + (Sat, 16 Jun 2012) Log Message: --- small code cleanup: let the caller close the indexer Modified Paths: -- trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java Modified: trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java === --- trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java 2012-06-16 06:33:15 UTC (rev 7360) +++ trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java 2012-06-16 09:04:14 UTC (rev 7361) @@ -86,23 +86,32 @@ } final BufferedReader reader = new BufferedReader(new FileReader(file)); System.out.println(Indexing to directory ' + indexDir + '...); -run(reader, new Indexer(FSDirectory.open(new File(indexDir)), Language.ENGLISH), false); +final FSDirectory directory = FSDirectory.open(new File(indexDir)); +try { + final Indexer indexer = new Indexer(directory, Language.ENGLISH); + try { +run(reader, indexer, false); + } finally { +indexer.close(); + } +} finally { + directory.close(); +} System.out.println(Index complete!); } - public static void run(String content, Directory dir, Language language, boolean isSentence) - throws IOException { + public static void run(String content, Directory dir, Language language, boolean isSentence) throws IOException { final BufferedReader br = new BufferedReader(new StringReader(content)); -run(br, new Indexer(dir, language), isSentence); + final Indexer indexer = new Indexer(dir, language); + try { +run(br, indexer, isSentence); + } finally { +indexer.close(); + } } public static void run(BufferedReader reader, Indexer indexer, boolean isSentence) throws IOException { -try { - indexer.index(reader, isSentence, -1); -} finally { - // TODO: let caller close indexer - indexer.close(); -} +indexer.index(reader, isSentence, -1); } public void index(String content, boolean isSentence, int docCount) throws IOException { @@ -112,25 +121,21 @@ public void index(BufferedReader reader, boolean isSentence, int docCount) throws IOException { String line = ; -int lineNo = 1; while ((line = reader.readLine()) != null) { if (isSentence) { -add(lineNo, -1, line); +add(-1, line); } else { final ListString sentences = sentenceTokenizer.tokenize(line); for (String sentence : sentences) { - add(lineNo, docCount, sentence); - // System.out.println(sentence); + add(docCount, sentence); } } - lineNo++; } } - private void add(int lineNo, int docCount, String sentence) throws IOException { + private void add(int docCount, String sentence) throws IOException { final Document doc = new Document(); doc.add(new Field(PatternRuleQueryBuilder.FIELD_NAME, sentence, Store.YES, Index.ANALYZED)); -// doc.add(new Field(FIELD_LINE, lineNo + , Store.YES, Index.NO)); if (docCount != -1) { doc.add(new Field(docCount, docCount + , Store.YES, Index.NO)); } This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7363] trunk/JLanguageTool/src/java/org/ languagetool/tools/StringTools.java
Revision: 7363 http://languagetool.svn.sourceforge.net/languagetool/?rev=7363view=rev Author: dnaber Date: 2012-06-16 09:40:07 + (Sat, 16 Jun 2012) Log Message: --- deprecate streamToString(InputStream), which is not used anyway (but it's public so we deprecate it instead of just deleting it) Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java Modified: trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java === --- trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java 2012-06-16 09:28:02 UTC (rev 7362) +++ trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java 2012-06-16 09:40:07 UTC (rev 7363) @@ -105,8 +105,7 @@ * code\n/code * @throws IOException */ - public static String readFile(final InputStream file, final String encoding) - throws IOException { + public static String readFile(final InputStream file, final String encoding) throws IOException { InputStreamReader isr = null; BufferedReader br = null; final StringBuilder sb = new StringBuilder(); @@ -238,6 +237,9 @@ return sb.toString(); } + /** + * @deprecated use {@link #streamToString(java.io.InputStream, String)} instead (deprecated since 1.8) + */ public static String streamToString(final InputStream is) throws IOException { final InputStreamReader isr = new InputStreamReader(is); try { @@ -264,7 +266,7 @@ } /** - * Escapes these characters: less than, bigger than, quote, ampersand. + * Escapes these characters: less than, greater than, quote, ampersand. */ public static String escapeHTML(final String s) { // this version is much faster than using s.replaceAll This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7364] trunk/JLanguageTool
Revision: 7364 http://languagetool.svn.sourceforge.net/languagetool/?rev=7364view=rev Author: dnaber Date: 2012-06-16 10:32:00 + (Sat, 16 Jun 2012) Log Message: --- deprecate SentenceTokenizer Modified Paths: -- trunk/JLanguageTool/CHANGES.txt trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java Modified: trunk/JLanguageTool/CHANGES.txt === --- trunk/JLanguageTool/CHANGES.txt 2012-06-16 09:40:07 UTC (rev 7363) +++ trunk/JLanguageTool/CHANGES.txt 2012-06-16 10:32:00 UTC (rev 7364) @@ -114,6 +114,9 @@ with unification. -Update of morfologik-stemming library to 1.5.3 (bug-fix release). + + -API: Deprecated some methods and the SentenceTokenizer class (SRXSentenceTokenizer should be + used instead) 1.7 (2012-03-25) Modified: trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java === --- trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java 2012-06-16 09:40:07 UTC (rev 7363) +++ trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java 2012-06-16 10:32:00 UTC (rev 7364) @@ -32,6 +32,7 @@ * but considering exceptions (e.g. abbreviations). * * @author Daniel Naber + * @deprecated use {@link SRXSentenceTokenizer} instead (deprecated since LT 1.8) */ public class SentenceTokenizer implements Tokenizer { This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7365] trunk/JLanguageTool/src
Revision: 7365 http://languagetool.svn.sourceforge.net/languagetool/?rev=7365view=rev Author: milek_pl Date: 2012-06-16 11:15:05 + (Sat, 16 Jun 2012) Log Message: --- fix ArrayOutOfBounds when a non-existent rule is enabled for STDIN mode Modified Paths: -- trunk/JLanguageTool/src/java/org/languagetool/Main.java trunk/JLanguageTool/src/test/org/languagetool/MainTest.java Modified: trunk/JLanguageTool/src/java/org/languagetool/Main.java === --- trunk/JLanguageTool/src/java/org/languagetool/Main.java 2012-06-16 10:32:00 UTC (rev 7364) +++ trunk/JLanguageTool/src/java/org/languagetool/Main.java 2012-06-16 11:15:05 UTC (rev 7365) @@ -218,7 +218,7 @@ int tmpLineOffset = 0; final ListString unknownWords = new ArrayListString(); StringBuilder sb = new StringBuilder(); -for (int ruleIndex = 0; ruleIndex runCount; ruleIndex++) { +for (int ruleIndex = 0; !rules.isEmpty() ruleIndex runCount; ruleIndex++) { currentRule = rules.get(ruleIndex); int matches = 0; long sentences = 0; Modified: trunk/JLanguageTool/src/test/org/languagetool/MainTest.java === --- trunk/JLanguageTool/src/test/org/languagetool/MainTest.java 2012-06-16 10:32:00 UTC (rev 7364) +++ trunk/JLanguageTool/src/test/org/languagetool/MainTest.java 2012-06-16 11:15:05 UTC (rev 7365) @@ -236,7 +236,20 @@ assertTrue(output.indexOf(Expected text language: English) == 0); assertTrue(output.contains(Rule ID: EN_A_VS_AN)); } + + public void testEnglishFileFakeRuleEnabled() throws Exception { +final String test = Zwuasdac?; +final byte[] b = test.getBytes(); +System.setIn(new ByteArrayInputStream(b)); +String[] args = new String[] {-l, en, -e, FOO_BAR_BLABLA, -}; + +Main.main(args); +String output = new String(this.out.toByteArray()); +assertTrue(output.indexOf(Expected text language: English) == 0); + } + + public void testEnglishFileAPI() throws Exception { final URL url = this.getClass().getResource(ENGLISH_TEST_FILE); final URI uri = new URI (url.toString()); This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7367] trunk/JLanguageTool
Revision: 7367 http://languagetool.svn.sourceforge.net/languagetool/?rev=7367view=rev Author: dnaber Date: 2012-06-16 16:22:02 + (Sat, 16 Jun 2012) Log Message: --- OOo/LO: fixed false alarm about word being uppercase when the previous sentence ended with a footnote - Sourceforge bug #3534637 Modified Paths: -- trunk/JLanguageTool/CHANGES.txt trunk/JLanguageTool/src/resource/segment.srx trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java Modified: trunk/JLanguageTool/CHANGES.txt === --- trunk/JLanguageTool/CHANGES.txt 2012-06-16 14:30:27 UTC (rev 7366) +++ trunk/JLanguageTool/CHANGES.txt 2012-06-16 16:22:02 UTC (rev 7367) @@ -65,6 +65,9 @@ -LibreOffice / OpenOffice integration: SingletonFactory now implements XServiceInfo (Stephan Bergmann) - Sourceforge bug #3526635 + + -LibreOffice / OpenOffice integration: Fixed false alarm about word being + uppercase when the previous sentence ended with a footnote - Sourceforge bug #3534637 -LanguageTool supports now separate rules for different local variants of a language, for example American English and British English. To use them from the command line, Modified: trunk/JLanguageTool/src/resource/segment.srx === --- trunk/JLanguageTool/src/resource/segment.srx2012-06-16 14:30:27 UTC (rev 7366) +++ trunk/JLanguageTool/src/resource/segment.srx2012-06-16 16:22:02 UTC (rev 7367) @@ -4504,7 +4504,7 @@ /rule !-- Break rules -- rule break=yes -beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak +beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak afterbreak/afterbreak /rule rule break=yes @@ -4657,7 +4657,7 @@ /rule !-- Break rules -- rule break=yes -beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak +beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak afterbreak/afterbreak /rule rule break=yes @@ -4685,7 +4685,7 @@ /rule !-- Break rules -- rule break=yes -beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak +beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak afterbreak/afterbreak /rule rule break=yes Modified: trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java === --- trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java 2012-06-16 14:30:27 UTC (rev 7366) +++ trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java 2012-06-16 16:22:02 UTC (rev 7367) @@ -19,6 +19,7 @@ package org.languagetool.tokenizers; import junit.framework.TestCase; +import org.languagetool.Language; import org.languagetool.TestTools; /** @@ -30,7 +31,7 @@ private SentenceTokenizer stokenizer = new SRXSentenceTokenizer(en); // accept only \n\n as paragraph: private SentenceTokenizer stokenizer2 = new SRXSentenceTokenizer(en); - + public void setUp() { stokenizer.setSingleLineBreaksMarksParagraph(true); stokenizer2.setSingleLineBreaksMarksParagraph(false); @@ -99,8 +100,26 @@ testSplit(new String[] { It works [really!]. , No doubt. }); testSplit(new String[] { It really(!) works well. }); testSplit(new String[] { It really[!] works well. }); + +testSplit(new String[] { This is a sentence.\u0002 , And this is another one. }); // footnotes in LibOO/OOo look like this } + public void testOfficeFootnoteTokenize() { +for (Language language : Language.REAL_LANGUAGES) { + if (language.getSentenceTokenizer().getClass() != SRXSentenceTokenizer.class) { +continue; + } + if (language == Language.KHMER || language == Language.MALAYALAM || language.getShortName().equals(pt)) { +// TODO: I don't know about these... +continue; + } + final String input = A sentence.\u0002 And another one.; + final SentenceTokenizer tokenizer = new SRXSentenceTokenizer(language.getShortName()); + assertEquals(Sentence not split correctly for + language + : ' + input + ', + [A sentence.\u0002 , And another one.], tokenizer.tokenize(input).toString()); +} + } + private void testSplit(String[] sentences) { TestTools.testSplit(sentences, stokenizer); } This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___
[LanguageTool] SF.net SVN: languagetool:[7368] trunk/JLanguageTool/src
Revision: 7368 http://languagetool.svn.sourceforge.net/languagetool/?rev=7368view=rev Author: dominikoeo Date: 2012-06-16 16:40:39 + (Sat, 16 Jun 2012) Log Message: --- [br] fixed disambiguation of the word o which caused false positive in: Me a rankfe chom d'o gortoz. Modified Paths: -- trunk/JLanguageTool/src/resource/br/disambiguation.xml trunk/JLanguageTool/src/rules/br/grammar.xml Modified: trunk/JLanguageTool/src/resource/br/disambiguation.xml === --- trunk/JLanguageTool/src/resource/br/disambiguation.xml 2012-06-16 16:22:02 UTC (rev 7367) +++ trunk/JLanguageTool/src/resource/br/disambiguation.xml 2012-06-16 16:40:39 UTC (rev 7368) @@ -123,6 +123,14 @@ disambig postag=L a/ /rule /rulegroup +rule id=D_O name=d’o + pattern +tokend’/token +markertokeno/token/marker +token postag=V inf/ + /pattern + disambig postag=R e p 3 obj/ +/rule rulegroup id=EZ_AN name=ez an rule pattern Modified: trunk/JLanguageTool/src/rules/br/grammar.xml === --- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 16:22:02 UTC (rev 7367) +++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 16:40:39 UTC (rev 7368) @@ -1734,6 +1734,7 @@ example type=correctPetra emaout o c’hortoz?/example example type=correctWarlene e ouiemp/example example type=correctPeogwir e selaouomp sonerezh hag e kanomp./example +example type=correctMe a rankfe chom d’o gortoz./example /rule rule pattern This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7369] trunk/JLanguageTool/build.xml
Revision: 7369 http://languagetool.svn.sourceforge.net/languagetool/?rev=7369view=rev Author: dnaber Date: 2012-06-16 18:44:26 + (Sat, 16 Jun 2012) Log Message: --- exclude more hunspell files from the OOo/LO distribution Modified Paths: -- trunk/JLanguageTool/build.xml Modified: trunk/JLanguageTool/build.xml === --- trunk/JLanguageTool/build.xml 2012-06-16 16:40:39 UTC (rev 7368) +++ trunk/JLanguageTool/build.xml 2012-06-16 18:44:26 UTC (rev 7369) @@ -469,7 +469,7 @@ target name=dist depends=ooo-internal description=Builds LibreOffice/ApacheOpenOffice extension zip compress=yes destfile=${dist.dir}/LanguageTool-${version}.oxt basedir=${dist.dir} includes=** excludes=deploy.sh,undeploy.sh,turn.sh,printlocales.jar,**/hunspell/**,${bliki.lib},junit.jar,Rule*.jar, - lucene*.jar,testwikipedia.*,LanguageTool.jar,LanguageToolGUI.jar,*.xsl,${jna.lib} / + lucene*.jar,testwikipedia.*,LanguageTool.jar,LanguageToolGUI.jar,*.xsl,${jna.lib},hunspell-*.jnilib,hunspell-*.dll,libhunspell-*.so / /target target name=fast depends=ooo-internal description=like dist, but without ZIP compression (for development only) This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7370] trunk/JLanguageTool/src/rules/br/grammar. xml
Revision: 7370 http://languagetool.svn.sourceforge.net/languagetool/?rev=7370view=rev Author: dominikoeo Date: 2012-06-16 20:46:37 + (Sat, 16 Jun 2012) Log Message: --- [br] fixed false positive in Hor marc'h-houarn Modified Paths: -- trunk/JLanguageTool/src/rules/br/grammar.xml Modified: trunk/JLanguageTool/src/rules/br/grammar.xml === --- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 18:44:26 UTC (rev 7369) +++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 20:46:37 UTC (rev 7370) @@ -1181,7 +1181,7 @@ pattern tokenhor/token marker -token inflected=yes regexp=yesk.*|marc’hexception regexp=yesc’h.*/exception/token +token inflected=yes regexp=yesk.*|marc’hexception regexp=yesc’h.*|marc’h.*/exception/token /marker /pattern messageKemmadur direizh. Goude ar ger «\1» e tro «\2» da suggestionmatch no=2 regexp_match=.(.*) regexp_replace=c’h$1//suggestion./message @@ -1193,6 +1193,7 @@ example type=incorrectHor markerkezeg/marker./example example type=incorrectHor markergezeg/marker./example example type=correctHor c’hezeg./example +example type=correctHor marc’h-houarn./example /rule rule pattern This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs
[LanguageTool] SF.net SVN: languagetool:[7372] trunk/JLanguageTool/src/rules/br/grammar. xml
Revision: 7372 http://languagetool.svn.sourceforge.net/languagetool/?rev=7372view=rev Author: dominikoeo Date: 2012-06-16 21:11:13 + (Sat, 16 Jun 2012) Log Message: --- [br] - fixed typo in message *dra vlotaat* - dre vlotaat - added more references to the grammar from E. CHALM. Modified Paths: -- trunk/JLanguageTool/src/rules/br/grammar.xml Modified: trunk/JLanguageTool/src/rules/br/grammar.xml === --- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 20:47:11 UTC (rev 7371) +++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:11:13 UTC (rev 7372) @@ -712,7 +712,7 @@ /token /marker /pattern -messageUr c’hemmadur dre vlotaat a zlefe bezañ gant an anv-gwan «\2» goude «\1»./message +messageUr c’hemmadur dre vlotaat a zlefe bezañ gant an anv-gwan «\2» goude «\1» (CHALM B3.1.2.4)./message example type=incorrectHemañ markertreut/marker/example example type=incorrectHemañ markerzreut/marker/example example type=correctHemañ dreut/example @@ -731,7 +731,7 @@ /token /marker /pattern -messageUr c’hemmadur dre vlotaat diglok a zlefe bezañ gant an anv-gwan «\2» goude «\1»./message +messageUr c’hemmadur dre vlotaat diglok a zlefe bezañ gant an anv-gwan «\2» goude «\1» (CHALM B3.1.2.3)./message example type=incorrectHennezh markerbihan/marker/example example type=incorrectHennezh markerpihan/marker/example example type=correctHennezh vihan/example @@ -780,7 +780,7 @@ /and /marker /pattern -messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «k». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=(g|c’h)(.*) regexp_replace=k$2//suggestion?/message +messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «k». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=(g|c’h)(.*) regexp_replace=k$2//suggestion (CHALM B3.3)?/message example type=incorrectHo markerc’harr/marker./example example type=incorrectHo markergarr/marker./example example type=correctHo markerkarr/marker./example @@ -799,7 +799,7 @@ /and /marker /pattern -messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=t$1//suggestion?/message +messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=t$1//suggestion (CHALM B3.3)?/message example type=incorrectHo markerdad/marker/example example type=incorrectHo markerzad/marker/example example type=correctHo markertad/marker/example @@ -814,7 +814,7 @@ /and /marker /pattern -messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=p$1//suggestion?/message +messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=p$1//suggestion (CHALM B.3.3)?/message example type=incorrectHo markerbenn/marker/example example type=incorrectHo markerfenn/marker/example example type=correctHo markerpenn/marker/example @@ -827,7 +827,7 @@ token regexp=yesg.*/token /marker /pattern -messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=k$1//suggestion?/message +messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) regexp_replace=k$1//suggestion (CHALM B3.3)?/message example type=incorrectMat eo ho markergwastell/marker./example example type=correctMat eo ho kwastell./example example type=incorrectLous eo ho markergenoù/marker./example @@ -840,7 +840,7 @@ token regexp=yes postag=[^M]*M.*:1:.* postag_regexp=yesw.*/token /marker /pattern -messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». Ha fellout a rae deoc’h skrivañ suggestionk\2/suggestion?/message +messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». Ha fellout a rae deoc’h skrivañ suggestionk\2/suggestion (CHALM B3.3)?/message example type=incorrectHo markerwazh/marker./example example type=correctHo kwazh./example /rule @@ -854,7 +854,7
[LanguageTool] SF.net SVN: languagetool:[7373] trunk/JLanguageTool/src/rules/br/grammar. xml
Revision: 7373 http://languagetool.svn.sourceforge.net/languagetool/?rev=7373view=rev Author: dominikoeo Date: 2012-06-16 21:33:39 + (Sat, 16 Jun 2012) Log Message: --- [br] fixed a few false positives. Modified Paths: -- trunk/JLanguageTool/src/rules/br/grammar.xml Modified: trunk/JLanguageTool/src/rules/br/grammar.xml === --- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:11:13 UTC (rev 7372) +++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:33:39 UTC (rev 7373) @@ -399,7 +399,7 @@ and token regexp=yes inflected=yes postag=UNKNOWN negate_pos=yesd.* exception inflected=yesda/exception -exception regexp=yes inflected=yest.*/exception +exception regexp=yes inflected=yest.*|dirak|dindan/exception exception regexp=yes[dt]ud(-(mañ|se|hont))?/exception /token !-- La mutation est faculative dans le verbe dleout d’après @@ -420,6 +420,7 @@ example type=correctAn ti a markerzle/marker bezañ amañ./example example type=incorrectPe markerdeiz/marker e teuy?/example example type=correctPe zeiz e teuy?/example +example type=correctAdsavit ar frazennoù-mañ en ur lakaat re markerdirak/marker an anvioù-gwan./example /rule rule pattern @@ -1054,7 +1055,9 @@ /token marker and - token regexp=yes inflected=yesd.*/token + token regexp=yes inflected=yesd.* +exception regexp=yes inflected=yes[zt].*/exception + /token token regexp=yes[tz].*exception regexp=yes[tz]ud/exception/token /and /marker @@ -1064,6 +1067,7 @@ example type=incorrectTeir markerzimezell/marker/example example type=correctTeir markerdimezell/marker/example example type=correctGant o zud/example +example type=correctTri zu/example /rule rule pattern @@ -1949,7 +1953,7 @@ marker token postag=[^M]*M:.* postag_regexp=yes exception negate_pos=yes postag=[^M]*M:.*|SENT_END postag_regexp=yes/ -exceptionvat/exception +exception regexp=yesva[dt]/exception /token /marker /pattern This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Languagetool-cvs mailing list Languagetool-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-cvs