[ 
https://issues.apache.org/jira/browse/LABS-190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12638988#action_12638988
 ] 

Javier Puerto commented on LABS-190:
------------------------------------

>Thorsten Scherler - 12/Oct/08 01:12 PM
>I reviewed your patch but I do not get:
>Why should parseRobots be cofigurable?

If you want to make a droid that read the robots.txt or no to the parser. I 
make configurable because the code before was fixed to true, i think that it 
must be false, but with possibility to change. There's no much sense to let 
parse this file.


> concurrency error in NoRobotClient
> ----------------------------------
>
>                 Key: LABS-190
>                 URL: https://issues.apache.org/jira/browse/LABS-190
>             Project: Labs
>          Issue Type: Bug
>          Components: Droids
>         Environment: Ubuntu 8.04, JDK 1.6
>            Reporter: Javier Puerto
>            Priority: Blocker
>         Attachments: NoRobotClient.diff
>
>
> Testing with droids, when the number of workers rise the NoRobotClient thows 
> an exception. I was searching for the error without sucess but it seems to be 
> caused by the concurrency with the base url.
> This is the error:
> pool-1-thread-3: Starting org.apache.droids.crawler.CrawlingWorker
> java.lang.StringIndexOutOfBoundsException: String index out of range: -1
>       at java.lang.String.substring(String.java:1938)
>       at java.lang.String.substring(String.java:1905)
>       at 
> org.apache.http.norobots.NoRobotClient.isUrlAllowed(NoRobotClient.java:202)
>       at org.apache.droids.protocol.http.Http.isAllowed(Http.java:87)
>       at 
> org.apache.droids.crawler.CrawlingWorker.execute(CrawlingWorker.java:49)
>       at 
> org.apache.droids.crawler.CrawlingWorker.execute(CrawlingWorker.java:1)
>       at 
> org.apache.droids.impl.MultiThreadedTaskMaster$WorkerRunner.run(MultiThreadedTaskMaster.java:186)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:885)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:907)
>       at java.lang.Thread.run(Thread.java:619)
> pool-1-thread-1: Worker "76" has finished.
> pool-1-thread-2: Url is allowed
> java.lang.StringIndexOutOfBoundsException: String index out of range: -2
>       at java.lang.String.substring(String.java:1938)
>       at java.lang.String.substring(String.java:1905)
>       at 
> org.apache.http.norobots.NoRobotClient.isUrlAllowed(NoRobotClient.java:202)
>       at org.apache.droids.protocol.http.Http.isAllowed(Http.java:87)
>       at 
> org.apache.droids.crawler.CrawlingWorker.execute(CrawlingWorker.java:49)
>       at 
> org.apache.droids.crawler.CrawlingWorker.execute(CrawlingWorker.java:1)
>       at 
> org.apache.droids.impl.MultiThreadedTaskMaster$WorkerRunner.run(MultiThreadedTaskMaster.java:186)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:885)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:907)
>       at java.lang.Thread.run(Thread.java:619)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to