[LanguageTool] SF.net SVN: languagetool:[7360] trunk/JLanguageTool/src/rules/eo/grammar. xml

2012-06-16 Thread dominikoeo
Revision: 7360
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7360view=rev
Author:   dominikoeo
Date: 2012-06-16 06:33:15 + (Sat, 16 Jun 2012)
Log Message:
---
[eo] fixed typo in message + minor rule improvement.

Modified Paths:
--
trunk/JLanguageTool/src/rules/eo/grammar.xml

Modified: trunk/JLanguageTool/src/rules/eo/grammar.xml
===
--- trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-16 05:47:49 UTC 
(rev 7359)
+++ trunk/JLanguageTool/src/rules/eo/grammar.xml2012-06-16 06:33:15 UTC 
(rev 7360)
@@ -209,7 +209,7 @@
 rulegroup id=AKCENTO_AKCXENTO name=ankcento kaj akĉento
   rule
 pattern
-  token inflected=yes regexp=yes 
skip=1angla|araba|ĉarma|diversa|loka|fremda|fremdlanda|franca|germana|hebrea|itala|hispana|irlanda|usona|[lmnŝv]ian|terura|havi/token
+  token inflected=yes regexp=yes 
skip=1angla|araba|bela|bretona|ĉarma|diversa|loka|fremda|fremdlanda|franca|germana|hebrea|itala|hispana|irlanda|japana|portugala|rusa|skota|usona|[lmnŝv]ian|terura|havi|[mvlŝn]ia/token
   marker
 token inflected=yesakcento/token
   /marker
@@ -3122,7 +3122,7 @@
   token postag=O .* postag_regexp=yes/
   token regexp=yeskiuj?n?/token
 /pattern
-messageOni ne uzas «\3» post «\1». Konsidu uzi suggestion\2 de 
tiu, \3/suggestion./message
+messageOni ne uzas «\3» post «\1». Konsideru uzi suggestion\2 de 
tiu, \3/suggestion./message
 
urlhttp://bertilow.com/pmeg/gramatiko/subfrazoj/rilataj/kiu_kio_kies/kiu.html#i-52m/url
 example type=incorrectNe kredu markerties parolon kiu/marker 
tro multe promesas./example
 example type=correctNe kredu parolon de tiu, kiu tro multe 
promesas./example
@@ -3134,7 +3134,7 @@
   token,/token
   token regexp=yeskiuj?n?/token
 /pattern
-messageOni ne uzas «\4» post «\1». Konsidu uzi suggestion\2 de 
tiu, \4/suggestion./message
+messageOni ne uzas «\4» post «\1». Konsideru uzi suggestion\2 de 
tiu, \4/suggestion./message
 example type=incorrectNe kredu markerties parolon, kiu/marker 
tro multe promesas./example
 example type=correctNe kredu parolon de tiu, kiu tro multe 
promesas./example
   /rule

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7350] trunk/JLanguageTool/src/java/org/ languagetool/language/Catalan.java

2012-06-16 Thread jaumeortola
Revision: 7350
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7350view=rev
Author:   jaumeortola
Date: 2012-06-15 10:04:24 + (Fri, 15 Jun 2012)
Log Message:
---
Jaume Ortola, new Catalan mantainer. 

Modified Paths:
--
trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java

Modified: trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java
===
--- trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java 
2012-06-15 09:55:22 UTC (rev 7349)
+++ trunk/JLanguageTool/src/java/org/languagetool/language/Catalan.java 
2012-06-15 10:04:24 UTC (rev 7350)
@@ -83,7 +83,7 @@
 
   @Override
   public Contributor[] getMaintainers() {
-return new Contributor[] {new Contributor(Ricard Roca)};
+return new Contributor[] {new Contributor(Ricard Roca), new 
Contributor(Jaume Ortolà) };
   }
 
   @Override

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] [ languagetool-Bugs-3534851 ] LanguageToolGUI --tray option

2012-06-16 Thread SourceForge . net
Bugs item #3534851, was opened at 2012-06-13 05:11
Message generated for change (Tracker Item Submitted) made by celvin
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655717aid=3534851group_id=110216

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Interface
Group: 1.7
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Brendel Oliver (celvin)
Assigned to: Marcin Miłkowski (milek_pl)
Summary: LanguageToolGUI --tray option

Initial Comment:
On the web-page it is stated : 
use the --tray option to start LanguageTool inside the system tray. After you 
copy any text to the clipboard, clicking LanguageTool in the system tray will 
cause the application to open and check the contents of the clipboard 
automatically

This does not seem to work : after copying text to the clipboard and clicking 
on the LanguageTool icon in the system tray, LanguagTool will start, but it 
will start as it does normally, that is showing the standard test text and no 
verification done automatically. 
I tried this on a Windos 7 64 bit desktop with java 32 bits built 1,6,0_32-b05

Cheers

Oliver


--

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655717aid=3534851group_id=110216

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] [ languagetool-Bugs-3534637 ] footnotes after full stop brake case check

2012-06-16 Thread SourceForge . net
Bugs item #3534637, was opened at 2012-06-12 09:31
Message generated for change (Tracker Item Submitted) made by gulp21-1
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655717aid=3534637group_id=110216

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: gulp21 (gulp21-1)
Assigned to: Nobody/Anonymous (nobody)
Summary: footnotes after full stop brake case check

Initial Comment:
Example: Das ist ein Satz.1 Außerdem ist das hier auch ein Satz.

Außerdem is marked as wrong.

--

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655717aid=3534637group_id=110216

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] [ languagetool-Feature Requests-3534852 ] GUI: spell check ?

2012-06-16 Thread SourceForge . net
Feature Requests item #3534852, was opened at 2012-06-13 05:15
Message generated for change (Settings changed) made by dnaber
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655720aid=3534852group_id=110216

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Interface Improvements (example)
Group: Next Release (example)
Status: Closed
Priority: 5
Private: No
Submitted By: Brendel Oliver (celvin)
Assigned to: Nobody/Anonymous (nobody)
Summary: GUI: spell check ? 

Initial Comment:
it would be great if the LanguagToolGUI could also do a spell check. 
I suppose to apply the grammar rules, LanguagTool needs to parse the words in 
the text field and recognize them. The spell checker could be simply saying 
this word is not recognized without giving any choice to replace. 
This would already help to find simple typos

thanks for considering this

Oliver


--

Comment By: Daniel Naber (dnaber)
Date: 2012-06-14 12:18

Message:
This is implemented in LanguageTool 1.8.

--

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detailatid=655720aid=3534852group_id=110216

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7361] trunk/JLanguageTool/src/dev/org/ languagetool/dev/index/Indexer.java

2012-06-16 Thread dnaber
Revision: 7361
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7361view=rev
Author:   dnaber
Date: 2012-06-16 09:04:14 + (Sat, 16 Jun 2012)
Log Message:
---
small code cleanup: let the caller close the indexer

Modified Paths:
--
trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java

Modified: trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java
===
--- trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java 
2012-06-16 06:33:15 UTC (rev 7360)
+++ trunk/JLanguageTool/src/dev/org/languagetool/dev/index/Indexer.java 
2012-06-16 09:04:14 UTC (rev 7361)
@@ -86,23 +86,32 @@
 }
 final BufferedReader reader = new BufferedReader(new FileReader(file));
 System.out.println(Indexing to directory ' + indexDir + '...);
-run(reader, new Indexer(FSDirectory.open(new File(indexDir)), 
Language.ENGLISH), false);
+final FSDirectory directory = FSDirectory.open(new File(indexDir));
+try {
+  final Indexer indexer = new Indexer(directory, Language.ENGLISH);
+  try {
+run(reader, indexer, false);
+  } finally {
+indexer.close();
+  }
+} finally {
+  directory.close();
+}
 System.out.println(Index complete!);
   }
 
-  public static void run(String content, Directory dir, Language language, 
boolean isSentence)
-  throws IOException {
+  public static void run(String content, Directory dir, Language language, 
boolean isSentence) throws IOException {
 final BufferedReader br = new BufferedReader(new StringReader(content));
-run(br, new Indexer(dir, language), isSentence);
+  final Indexer indexer = new Indexer(dir, language);
+  try {
+run(br, indexer, isSentence);
+  } finally {
+indexer.close();
+  }
   }
 
   public static void run(BufferedReader reader, Indexer indexer, boolean 
isSentence) throws IOException {
-try {
-  indexer.index(reader, isSentence, -1);
-} finally {
-  // TODO: let caller close indexer
-  indexer.close();
-}
+indexer.index(reader, isSentence, -1);
   }
 
   public void index(String content, boolean isSentence, int docCount) throws 
IOException {
@@ -112,25 +121,21 @@
 
   public void index(BufferedReader reader, boolean isSentence, int docCount) 
throws IOException {
 String line = ;
-int lineNo = 1;
 while ((line = reader.readLine()) != null) {
   if (isSentence) {
-add(lineNo, -1, line);
+add(-1, line);
   } else {
 final ListString sentences = sentenceTokenizer.tokenize(line);
 for (String sentence : sentences) {
-  add(lineNo, docCount, sentence);
-  // System.out.println(sentence);
+  add(docCount, sentence);
 }
   }
-  lineNo++;
 }
   }
 
-  private void add(int lineNo, int docCount, String sentence) throws 
IOException {
+  private void add(int docCount, String sentence) throws IOException {
 final Document doc = new Document();
 doc.add(new Field(PatternRuleQueryBuilder.FIELD_NAME, sentence, Store.YES, 
Index.ANALYZED));
-// doc.add(new Field(FIELD_LINE, lineNo + , Store.YES, Index.NO));
 if (docCount != -1) {
   doc.add(new Field(docCount, docCount + , Store.YES, Index.NO));
 }

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7363] trunk/JLanguageTool/src/java/org/ languagetool/tools/StringTools.java

2012-06-16 Thread dnaber
Revision: 7363
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7363view=rev
Author:   dnaber
Date: 2012-06-16 09:40:07 + (Sat, 16 Jun 2012)
Log Message:
---
deprecate streamToString(InputStream), which is not used anyway (but it's 
public so we deprecate it instead of just deleting it)

Modified Paths:
--
trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java

Modified: trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java
===
--- trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java
2012-06-16 09:28:02 UTC (rev 7362)
+++ trunk/JLanguageTool/src/java/org/languagetool/tools/StringTools.java
2012-06-16 09:40:07 UTC (rev 7363)
@@ -105,8 +105,7 @@
* code\n/code
* @throws IOException
*/
-  public static String readFile(final InputStream file, final String encoding)
-  throws IOException {
+  public static String readFile(final InputStream file, final String encoding) 
throws IOException {
 InputStreamReader isr = null;
 BufferedReader br = null;
 final StringBuilder sb = new StringBuilder();
@@ -238,6 +237,9 @@
 return sb.toString();
   }
 
+  /**
+   * @deprecated use {@link #streamToString(java.io.InputStream, String)} 
instead (deprecated since 1.8)
+   */
   public static String streamToString(final InputStream is) throws IOException 
{
 final InputStreamReader isr = new InputStreamReader(is);
 try {
@@ -264,7 +266,7 @@
   }
 
   /**
-   * Escapes these characters: less than, bigger than, quote, ampersand.
+   * Escapes these characters: less than, greater than, quote, ampersand.
*/
   public static String escapeHTML(final String s) {
 // this version is much faster than using s.replaceAll

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7364] trunk/JLanguageTool

2012-06-16 Thread dnaber
Revision: 7364
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7364view=rev
Author:   dnaber
Date: 2012-06-16 10:32:00 + (Sat, 16 Jun 2012)
Log Message:
---
deprecate SentenceTokenizer

Modified Paths:
--
trunk/JLanguageTool/CHANGES.txt

trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java

Modified: trunk/JLanguageTool/CHANGES.txt
===
--- trunk/JLanguageTool/CHANGES.txt 2012-06-16 09:40:07 UTC (rev 7363)
+++ trunk/JLanguageTool/CHANGES.txt 2012-06-16 10:32:00 UTC (rev 7364)
@@ -114,6 +114,9 @@
   with unification.
  
  -Update of morfologik-stemming library to 1.5.3 (bug-fix release).
+
+ -API: Deprecated some methods and the SentenceTokenizer class 
(SRXSentenceTokenizer should be
+  used instead)
  
 
 1.7 (2012-03-25)

Modified: 
trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java
===
--- 
trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java 
2012-06-16 09:40:07 UTC (rev 7363)
+++ 
trunk/JLanguageTool/src/java/org/languagetool/tokenizers/SentenceTokenizer.java 
2012-06-16 10:32:00 UTC (rev 7364)
@@ -32,6 +32,7 @@
  * but considering exceptions (e.g. abbreviations).
  *
  * @author Daniel Naber
+ * @deprecated use {@link SRXSentenceTokenizer} instead (deprecated since LT 
1.8)
  */
 public class SentenceTokenizer implements Tokenizer {
 

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7365] trunk/JLanguageTool/src

2012-06-16 Thread milek_pl
Revision: 7365
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7365view=rev
Author:   milek_pl
Date: 2012-06-16 11:15:05 + (Sat, 16 Jun 2012)
Log Message:
---
fix ArrayOutOfBounds when a non-existent rule is enabled for STDIN mode

Modified Paths:
--
trunk/JLanguageTool/src/java/org/languagetool/Main.java
trunk/JLanguageTool/src/test/org/languagetool/MainTest.java

Modified: trunk/JLanguageTool/src/java/org/languagetool/Main.java
===
--- trunk/JLanguageTool/src/java/org/languagetool/Main.java 2012-06-16 
10:32:00 UTC (rev 7364)
+++ trunk/JLanguageTool/src/java/org/languagetool/Main.java 2012-06-16 
11:15:05 UTC (rev 7365)
@@ -218,7 +218,7 @@
 int tmpLineOffset = 0;
 final ListString unknownWords = new ArrayListString();
 StringBuilder sb = new StringBuilder();
-for (int ruleIndex = 0; ruleIndex  runCount; ruleIndex++) {
+for (int ruleIndex = 0; !rules.isEmpty()  ruleIndex  runCount; 
ruleIndex++) {
   currentRule = rules.get(ruleIndex);
   int matches = 0;
   long sentences = 0;

Modified: trunk/JLanguageTool/src/test/org/languagetool/MainTest.java
===
--- trunk/JLanguageTool/src/test/org/languagetool/MainTest.java 2012-06-16 
10:32:00 UTC (rev 7364)
+++ trunk/JLanguageTool/src/test/org/languagetool/MainTest.java 2012-06-16 
11:15:05 UTC (rev 7365)
@@ -236,7 +236,20 @@
 assertTrue(output.indexOf(Expected text language: English) == 0);
 assertTrue(output.contains(Rule ID: EN_A_VS_AN));
   }
+
+  public void testEnglishFileFakeRuleEnabled() throws Exception {
+final String test = Zwuasdac?;
+final byte[] b = test.getBytes();
+System.setIn(new ByteArrayInputStream(b));
+String[] args = new String[] {-l, en, -e, FOO_BAR_BLABLA, -};
+
+Main.main(args);
+String output = new String(this.out.toByteArray());
+assertTrue(output.indexOf(Expected text language: English) == 0);
+  }
+
   
+  
   public void testEnglishFileAPI() throws Exception {
 final URL url = this.getClass().getResource(ENGLISH_TEST_FILE);
 final URI uri = new URI (url.toString());

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7367] trunk/JLanguageTool

2012-06-16 Thread dnaber
Revision: 7367
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7367view=rev
Author:   dnaber
Date: 2012-06-16 16:22:02 + (Sat, 16 Jun 2012)
Log Message:
---
OOo/LO: fixed false alarm about word being uppercase when the previous sentence 
ended with a footnote - Sourceforge bug #3534637

Modified Paths:
--
trunk/JLanguageTool/CHANGES.txt
trunk/JLanguageTool/src/resource/segment.srx

trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java

Modified: trunk/JLanguageTool/CHANGES.txt
===
--- trunk/JLanguageTool/CHANGES.txt 2012-06-16 14:30:27 UTC (rev 7366)
+++ trunk/JLanguageTool/CHANGES.txt 2012-06-16 16:22:02 UTC (rev 7367)
@@ -65,6 +65,9 @@
 
  -LibreOffice / OpenOffice integration: SingletonFactory now implements
   XServiceInfo (Stephan Bergmann) - Sourceforge bug #3526635
+
+ -LibreOffice / OpenOffice integration: Fixed false alarm about word being
+  uppercase when the previous sentence ended with a footnote - Sourceforge bug 
#3534637
   
  -LanguageTool supports now separate rules for different local variants of a 
language,
   for example American English and British English. To use them from the 
command line, 

Modified: trunk/JLanguageTool/src/resource/segment.srx
===
--- trunk/JLanguageTool/src/resource/segment.srx2012-06-16 14:30:27 UTC 
(rev 7366)
+++ trunk/JLanguageTool/src/resource/segment.srx2012-06-16 16:22:02 UTC 
(rev 7367)
@@ -4504,7 +4504,7 @@
 /rule
 !-- Break rules --
 rule break=yes
-beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak
+beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak
 afterbreak/afterbreak
 /rule
 rule break=yes
@@ -4657,7 +4657,7 @@
 /rule
 !-- Break rules --
 rule break=yes
-beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak
+beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak
 afterbreak/afterbreak
 /rule
 rule break=yes
@@ -4685,7 +4685,7 @@
 /rule
 !-- Break rules --
 rule break=yes
-beforebreak[\.!?…]['||«|\)|\]|\}]?\s+/beforebreak
+beforebreak[\.!?…][\u0002|'||«|\)|\]|\}]?\s+/beforebreak
 afterbreak/afterbreak
 /rule
 rule break=yes

Modified: 
trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java
===
--- 
trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java
  2012-06-16 14:30:27 UTC (rev 7366)
+++ 
trunk/JLanguageTool/src/test/org/languagetool/tokenizers/SRXSentenceTokenizerTest.java
  2012-06-16 16:22:02 UTC (rev 7367)
@@ -19,6 +19,7 @@
 package org.languagetool.tokenizers;
 
 import junit.framework.TestCase;
+import org.languagetool.Language;
 import org.languagetool.TestTools;
 
 /**
@@ -30,7 +31,7 @@
   private SentenceTokenizer stokenizer = new SRXSentenceTokenizer(en);
   // accept only \n\n as paragraph:
   private SentenceTokenizer stokenizer2 = new SRXSentenceTokenizer(en);
-  
+
   public void setUp() {
 stokenizer.setSingleLineBreaksMarksParagraph(true);  
 stokenizer2.setSingleLineBreaksMarksParagraph(false);  
@@ -99,8 +100,26 @@
 testSplit(new String[] { It works [really!]. , No doubt. });
 testSplit(new String[] { It really(!) works well. });
 testSplit(new String[] { It really[!] works well. });
+
+testSplit(new String[] { This is a sentence.\u0002 , And this is 
another one. });  // footnotes in LibOO/OOo look like this
   }
 
+  public void testOfficeFootnoteTokenize() {
+for (Language language : Language.REAL_LANGUAGES) {
+  if (language.getSentenceTokenizer().getClass() != 
SRXSentenceTokenizer.class) {
+continue;
+  }
+  if (language == Language.KHMER || language == Language.MALAYALAM || 
language.getShortName().equals(pt)) {
+// TODO: I don't know about these...
+continue;
+  }
+  final String input = A sentence.\u0002 And another one.;
+  final SentenceTokenizer tokenizer = new 
SRXSentenceTokenizer(language.getShortName());
+  assertEquals(Sentence not split correctly for  + language + : ' + 
input + ',
+  [A sentence.\u0002 , And another one.], 
tokenizer.tokenize(input).toString());
+}
+  }
+
   private void testSplit(String[] sentences) {
 TestTools.testSplit(sentences, stokenizer);
   }

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___

[LanguageTool] SF.net SVN: languagetool:[7368] trunk/JLanguageTool/src

2012-06-16 Thread dominikoeo
Revision: 7368
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7368view=rev
Author:   dominikoeo
Date: 2012-06-16 16:40:39 + (Sat, 16 Jun 2012)
Log Message:
---
[br] fixed disambiguation of the word o which caused
 false positive in:
 Me a rankfe chom d'o gortoz.

Modified Paths:
--
trunk/JLanguageTool/src/resource/br/disambiguation.xml
trunk/JLanguageTool/src/rules/br/grammar.xml

Modified: trunk/JLanguageTool/src/resource/br/disambiguation.xml
===
--- trunk/JLanguageTool/src/resource/br/disambiguation.xml  2012-06-16 
16:22:02 UTC (rev 7367)
+++ trunk/JLanguageTool/src/resource/br/disambiguation.xml  2012-06-16 
16:40:39 UTC (rev 7368)
@@ -123,6 +123,14 @@
 disambig postag=L a/
 /rule
 /rulegroup
+rule id=D_O name=d’o
+  pattern
+tokend’/token
+markertokeno/token/marker
+token postag=V inf/
+  /pattern
+  disambig postag=R e p 3 obj/
+/rule
 rulegroup id=EZ_AN name=ez an
 rule
 pattern

Modified: trunk/JLanguageTool/src/rules/br/grammar.xml
===
--- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 16:22:02 UTC 
(rev 7367)
+++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 16:40:39 UTC 
(rev 7368)
@@ -1734,6 +1734,7 @@
 example type=correctPetra emaout o c’hortoz?/example
 example type=correctWarlene e ouiemp/example
 example type=correctPeogwir e selaouomp sonerezh hag e 
kanomp./example
+example type=correctMe a rankfe chom d’o gortoz./example
   /rule
   rule
 pattern

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7369] trunk/JLanguageTool/build.xml

2012-06-16 Thread dnaber
Revision: 7369
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7369view=rev
Author:   dnaber
Date: 2012-06-16 18:44:26 + (Sat, 16 Jun 2012)
Log Message:
---
exclude more hunspell files from the OOo/LO distribution

Modified Paths:
--
trunk/JLanguageTool/build.xml

Modified: trunk/JLanguageTool/build.xml
===
--- trunk/JLanguageTool/build.xml   2012-06-16 16:40:39 UTC (rev 7368)
+++ trunk/JLanguageTool/build.xml   2012-06-16 18:44:26 UTC (rev 7369)
@@ -469,7 +469,7 @@
 target name=dist depends=ooo-internal description=Builds 
LibreOffice/ApacheOpenOffice extension
 zip compress=yes destfile=${dist.dir}/LanguageTool-${version}.oxt 
basedir=${dist.dir} includes=** 

excludes=deploy.sh,undeploy.sh,turn.sh,printlocales.jar,**/hunspell/**,${bliki.lib},junit.jar,Rule*.jar,
-   
lucene*.jar,testwikipedia.*,LanguageTool.jar,LanguageToolGUI.jar,*.xsl,${jna.lib}
 /
+   
lucene*.jar,testwikipedia.*,LanguageTool.jar,LanguageToolGUI.jar,*.xsl,${jna.lib},hunspell-*.jnilib,hunspell-*.dll,libhunspell-*.so
 /
 /target
 
 target name=fast depends=ooo-internal description=like dist, but without 
ZIP compression (for development only)

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7370] trunk/JLanguageTool/src/rules/br/grammar. xml

2012-06-16 Thread dominikoeo
Revision: 7370
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7370view=rev
Author:   dominikoeo
Date: 2012-06-16 20:46:37 + (Sat, 16 Jun 2012)
Log Message:
---
[br] fixed false positive in Hor marc'h-houarn

Modified Paths:
--
trunk/JLanguageTool/src/rules/br/grammar.xml

Modified: trunk/JLanguageTool/src/rules/br/grammar.xml
===
--- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 18:44:26 UTC 
(rev 7369)
+++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 20:46:37 UTC 
(rev 7370)
@@ -1181,7 +1181,7 @@
 pattern
   tokenhor/token
   marker
-token inflected=yes regexp=yesk.*|marc’hexception 
regexp=yesc’h.*/exception/token
+token inflected=yes regexp=yesk.*|marc’hexception 
regexp=yesc’h.*|marc’h.*/exception/token
   /marker
 /pattern
 messageKemmadur direizh. Goude ar ger «\1» e tro «\2» da 
suggestionmatch no=2 regexp_match=.(.*) 
regexp_replace=c’h$1//suggestion./message
@@ -1193,6 +1193,7 @@
 example type=incorrectHor markerkezeg/marker./example
 example type=incorrectHor markergezeg/marker./example
 example type=correctHor c’hezeg./example
+example type=correctHor marc’h-houarn./example
   /rule
   rule
 pattern

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs


[LanguageTool] SF.net SVN: languagetool:[7372] trunk/JLanguageTool/src/rules/br/grammar. xml

2012-06-16 Thread dominikoeo
Revision: 7372
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7372view=rev
Author:   dominikoeo
Date: 2012-06-16 21:11:13 + (Sat, 16 Jun 2012)
Log Message:
---
[br] - fixed typo in message *dra vlotaat* - dre vlotaat
 - added more references to the grammar from E. CHALM.

Modified Paths:
--
trunk/JLanguageTool/src/rules/br/grammar.xml

Modified: trunk/JLanguageTool/src/rules/br/grammar.xml
===
--- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 20:47:11 UTC 
(rev 7371)
+++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:11:13 UTC 
(rev 7372)
@@ -712,7 +712,7 @@
 /token
   /marker
 /pattern
-messageUr c’hemmadur dre vlotaat a zlefe bezañ gant an anv-gwan «\2» 
goude «\1»./message
+messageUr c’hemmadur dre vlotaat a zlefe bezañ gant an anv-gwan «\2» 
goude «\1» (CHALM B3.1.2.4)./message
 example type=incorrectHemañ markertreut/marker/example
 example type=incorrectHemañ markerzreut/marker/example
 example type=correctHemañ dreut/example
@@ -731,7 +731,7 @@
 /token
   /marker
 /pattern
-messageUr c’hemmadur dre vlotaat diglok a zlefe bezañ gant an 
anv-gwan «\2» goude «\1»./message
+messageUr c’hemmadur dre vlotaat diglok a zlefe bezañ gant an 
anv-gwan «\2» goude «\1» (CHALM B3.1.2.3)./message
 example type=incorrectHennezh markerbihan/marker/example
 example type=incorrectHennezh markerpihan/marker/example
 example type=correctHennezh vihan/example
@@ -780,7 +780,7 @@
 /and
   /marker
 /pattern
-messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «k». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=(g|c’h)(.*) regexp_replace=k$2//suggestion?/message
+messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «k». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=(g|c’h)(.*) regexp_replace=k$2//suggestion (CHALM 
B3.3)?/message
 example type=incorrectHo markerc’harr/marker./example
 example type=incorrectHo markergarr/marker./example
 example type=correctHo markerkarr/marker./example
@@ -799,7 +799,7 @@
 /and
   /marker
 /pattern
-messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=.(.*) regexp_replace=t$1//suggestion?/message
+messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=.(.*) regexp_replace=t$1//suggestion (CHALM B3.3)?/message
 example type=incorrectHo markerdad/marker/example
 example type=incorrectHo markerzad/marker/example
 example type=correctHo markertad/marker/example
@@ -814,7 +814,7 @@
 /and
   /marker
 /pattern
-messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=.(.*) regexp_replace=p$1//suggestion?/message
+messageNe zlefe ket bezañ ur c’hemmadur goude ar ger «\1» gant ur 
ger a grog gant «t». Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 
regexp_match=.(.*) regexp_replace=p$1//suggestion (CHALM 
B.3.3)?/message
 example type=incorrectHo markerbenn/marker/example
 example type=incorrectHo markerfenn/marker/example
 example type=correctHo markerpenn/marker/example
@@ -827,7 +827,7 @@
 token regexp=yesg.*/token
   /marker
 /pattern
-messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». 
Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) 
regexp_replace=k$1//suggestion?/message
+messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». 
Ha fellout a rae deoc’h skrivañ suggestionmatch no=2 regexp_match=.(.*) 
regexp_replace=k$1//suggestion (CHALM B3.3)?/message
 example type=incorrectMat eo ho 
markergwastell/marker./example
 example type=correctMat eo ho kwastell./example
 example type=incorrectLous eo ho markergenoù/marker./example
@@ -840,7 +840,7 @@
 token regexp=yes postag=[^M]*M.*:1:.* 
postag_regexp=yesw.*/token
   /marker
 /pattern
-messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». 
Ha fellout a rae deoc’h skrivañ suggestionk\2/suggestion?/message
+messageUr c’hemmadur dre galetaat a zlefe bezañ goude ar ger «\1». 
Ha fellout a rae deoc’h skrivañ suggestionk\2/suggestion (CHALM 
B3.3)?/message
 example type=incorrectHo markerwazh/marker./example
 example type=correctHo kwazh./example
   /rule
@@ -854,7 +854,7 

[LanguageTool] SF.net SVN: languagetool:[7373] trunk/JLanguageTool/src/rules/br/grammar. xml

2012-06-16 Thread dominikoeo
Revision: 7373
  
http://languagetool.svn.sourceforge.net/languagetool/?rev=7373view=rev
Author:   dominikoeo
Date: 2012-06-16 21:33:39 + (Sat, 16 Jun 2012)
Log Message:
---
[br] fixed a few false positives.

Modified Paths:
--
trunk/JLanguageTool/src/rules/br/grammar.xml

Modified: trunk/JLanguageTool/src/rules/br/grammar.xml
===
--- trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:11:13 UTC 
(rev 7372)
+++ trunk/JLanguageTool/src/rules/br/grammar.xml2012-06-16 21:33:39 UTC 
(rev 7373)
@@ -399,7 +399,7 @@
 and
   token regexp=yes inflected=yes postag=UNKNOWN 
negate_pos=yesd.*
 exception inflected=yesda/exception
-exception regexp=yes inflected=yest.*/exception
+exception regexp=yes 
inflected=yest.*|dirak|dindan/exception
 exception regexp=yes[dt]ud(-(mañ|se|hont))?/exception
   /token
   !-- La mutation est faculative dans le verbe dleout d’après
@@ -420,6 +420,7 @@
 example type=correctAn ti a markerzle/marker bezañ 
amañ./example
 example type=incorrectPe markerdeiz/marker e teuy?/example
 example type=correctPe zeiz e teuy?/example
+example type=correctAdsavit ar frazennoù-mañ en ur lakaat re 
markerdirak/marker an anvioù-gwan./example
   /rule
   rule
 pattern
@@ -1054,7 +1055,9 @@
   /token
   marker
 and
-  token regexp=yes inflected=yesd.*/token
+  token regexp=yes inflected=yesd.*
+exception regexp=yes inflected=yes[zt].*/exception
+  /token
   token regexp=yes[tz].*exception 
regexp=yes[tz]ud/exception/token
 /and
   /marker
@@ -1064,6 +1067,7 @@
 example type=incorrectTeir markerzimezell/marker/example
 example type=correctTeir markerdimezell/marker/example
 example type=correctGant o zud/example
+example type=correctTri zu/example
   /rule
   rule
 pattern
@@ -1949,7 +1953,7 @@
 marker
   token postag=[^M]*M:.* postag_regexp=yes
 exception negate_pos=yes postag=[^M]*M:.*|SENT_END 
postag_regexp=yes/
-exceptionvat/exception
+exception regexp=yesva[dt]/exception
   /token
 /marker
   /pattern

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-cvs mailing list
Languagetool-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-cvs