During a crawl of about 3.8M tlds to a depth of 2, when I try to index the 
segments, I get the following error:

java.lang.StackOverflowError
        at java.util.regex.Pattern$Loop.match(Pattern.java:4295)
Any help with this error would be much appreciated, I have encountered this 
before. 

here is the last 10 lines of the hadoop.log file:

tail -n 10 hadoop.log.2010-01-10
        at java.util.regex.Pattern$GroupTail.match(Pattern.java:4227)
        at java.util.regex.Pattern$BranchConn.match(Pattern.java:4078)
        at java.util.regex.Pattern$Ques.match(Pattern.java:3691)
        at java.util.regex.Pattern$Branch.match(Pattern.java:4114)
        at java.util.regex.Pattern$GroupHead.match(Pattern.java:4168)
        at java.util.regex.Pattern$Loop.match(Pattern.java:4295)
        at java.util.regex.Pattern$GroupTail.match(Pattern.java:4227)
        at java.util.regex.Pattern$BranchConn.match(Pattern.java:4078)
        at java.util.regex.Pattern$Ques.match(Pattern.java:3691)
2010-01-11 00:31:53,221 WARN  io.UTF8 - truncating long string: 62492 chars, 
starting with java.lang.StackOverf



Eric Osgood
---------------------------------------------
Cal Poly - Computer Engineering, Moon Valley Software
---------------------------------------------
eosg...@calpoly.edu, e...@lakemeadonline.com
---------------------------------------------
www.calpoly.edu/~eosgood, www.lakemeadonline.com

Reply via email to