Hi. I am currently using languagetool-commandline checking billions of paragraphs.
It works fine, except for some dumps, like the ones below. I need the tool to continue, for I need the data. When it has been processed, I might try to find the items it crashes on. It looks like it is all string things. Could it crash in utf8 encoding errors? Ruud java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:101) at org.languagetool.JLanguageTool.check(JLanguageTool.java:576) at org.languagetool.JLanguageTool.check(JLanguageTool.java:534) at org.languagetool.JLanguageTool.check(JLanguageTool.java:530) at org.languagetool.commandline.CommandLineTools.checkText(CommandLineTools.java:96) at org.languagetool.commandline.Main.handleLine(Main.java:386) at org.languagetool.commandline.Main.runOnFileLineByLine(Main.java:286) at org.languagetool.commandline.Main.runOnFile(Main.java:166) at org.languagetool.commandline.Main.main(Main.java:519) Caused by: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:98) ... 8 more Caused by: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.lang.String.charAt(String.java:658) at org.languagetool.rules.CommaWhitespaceRule.isNumberOrDot(CommaWhitespaceRule.java:130) at org.languagetool.rules.CommaWhitespaceRule.match(CommaWhitespaceRule.java:92) at org.languagetool.JLanguageTool.checkAnalyzedSentence(JLanguageTool.java:686) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:995) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:962) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) Exception in thread "main" java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:101) at org.languagetool.JLanguageTool.check(JLanguageTool.java:576) at org.languagetool.JLanguageTool.check(JLanguageTool.java:534) at org.languagetool.JLanguageTool.check(JLanguageTool.java:530) at org.languagetool.commandline.CommandLineTools.checkText(CommandLineTools.java:96) at org.languagetool.commandline.Main.handleLine(Main.java:386) at org.languagetool.commandline.Main.runOnFileLineByLine(Main.java:286) at org.languagetool.commandline.Main.runOnFile(Main.java:166) at org.languagetool.commandline.Main.main(Main.java:519) Caused by: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:98) ... 8 more Caused by: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.lang.String.charAt(String.java:658) at org.languagetool.rules.CommaWhitespaceRule.isNumberOrDot(CommaWhitespaceRule.java:130) at org.languagetool.rules.CommaWhitespaceRule.match(CommaWhitespaceRule.java:92) at org.languagetool.JLanguageTool.checkAnalyzedSentence(JLanguageTool.java:686) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:995) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:962) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) Exception in thread "main" java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:101) at org.languagetool.JLanguageTool.check(JLanguageTool.java:576) at org.languagetool.JLanguageTool.check(JLanguageTool.java:534) at org.languagetool.JLanguageTool.check(JLanguageTool.java:530) at org.languagetool.commandline.CommandLineTools.checkText(CommandLineTools.java:96) at org.languagetool.commandline.Main.handleLine(Main.java:386) at org.languagetool.commandline.Main.runOnFileLineByLine(Main.java:286) at org.languagetool.commandline.Main.runOnFile(Main.java:166) at org.languagetool.commandline.Main.main(Main.java:519) Caused by: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:98) ... 8 more Caused by: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.lang.String.charAt(String.java:658) at org.languagetool.rules.CommaWhitespaceRule.isNumberOrDot(CommaWhitespaceRule.java:130) at org.languagetool.rules.CommaWhitespaceRule.match(CommaWhitespaceRule.java:92) at org.languagetool.JLanguageTool.checkAnalyzedSentence(JLanguageTool.java:686) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:995) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:962) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) Exception in thread "main" java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:101) at org.languagetool.JLanguageTool.check(JLanguageTool.java:576) at org.languagetool.JLanguageTool.check(JLanguageTool.java:534) at org.languagetool.JLanguageTool.check(JLanguageTool.java:530) at org.languagetool.commandline.CommandLineTools.checkText(CommandLineTools.java:96) at org.languagetool.commandline.Main.handleLine(Main.java:386) at org.languagetool.commandline.Main.runOnFileLineByLine(Main.java:286) at org.languagetool.commandline.Main.runOnFile(Main.java:166) at org.languagetool.commandline.Main.main(Main.java:519) Caused by: java.util.concurrent.ExecutionException: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at org.languagetool.MultiThreadedJLanguageTool.performCheck(MultiThreadedJLanguageTool.java:98) ... 8 more Caused by: java.lang.StringIndexOutOfBoundsException: String index out of range: 0 at java.lang.String.charAt(String.java:658) at org.languagetool.rules.CommaWhitespaceRule.isNumberOrDot(CommaWhitespaceRule.java:130) at org.languagetool.rules.CommaWhitespaceRule.match(CommaWhitespaceRule.java:92) at org.languagetool.JLanguageTool.checkAnalyzedSentence(JLanguageTool.java:686) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:995) at org.languagetool.JLanguageTool$TextCheckCallable.call(JLanguageTool.java:962) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) ------------------------------------------------------------------------------ The best possible search technologies are now affordable for all companies. Download your FREE open source Enterprise Search Engine today! Our experts will assist you in its installation for $59/mo, no commitment. Test it for FREE on our Cloud platform anytime! http://pubads.g.doubleclick.net/gampad/clk?id=145328191&iu=/4140/ostg.clktrk _______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel