[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-05-09 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13652805#comment-13652805 ] Tejas Patil commented on NUTCH-1031: I had forgot to add crawler-commons dependency in

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-05-09 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653012#comment-13653012 ] Lewis John McGibbney commented on NUTCH-1031: - Hi Tejas, A quick note on

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-04-29 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13644754#comment-13644754 ] Lewis John McGibbney commented on NUTCH-1031: - +1 from me Tejas. Unit tests

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-04-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13644868#comment-13644868 ] Hudson commented on NUTCH-1031: --- Integrated in Nutch-nutchgora #587 (See

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-04-05 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13624194#comment-13624194 ] Tejas Patil commented on NUTCH-1031: I have removed the @author tag and ported the

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-04-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13624291#comment-13624291 ] Hudson commented on NUTCH-1031: --- Integrated in Nutch-trunk #2156 (See

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-03-15 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13603406#comment-13603406 ] Sebastian Nagel commented on NUTCH-1031: +1 (nothing to complain) P.S.: see

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-03-15 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13603482#comment-13603482 ] Sebastian Nagel commented on NUTCH-1031: There are differences between trunk and

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-03-08 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13597515#comment-13597515 ] Lewis John McGibbney commented on NUTCH-1031: - Hi Tejas. Sorry for taking

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-03-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13594391#comment-13594391 ] Lewis John McGibbney commented on NUTCH-1031: - MHi Tejas. If you go to search

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-25 Thread lufeng (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585730#comment-13585730 ] lufeng commented on NUTCH-1031: --- Hi Tejas 1. The EmptyRobotRules class is not delete in

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-24 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13585467#comment-13585467 ] Tejas Patil commented on NUTCH-1031: Hi Sebastian, Thanks for your time and suggesting

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-22 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13584824#comment-13584824 ] Sebastian Nagel commented on NUTCH-1031: Hi Tejas, a test of

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13583013#comment-13583013 ] Tejas Patil commented on NUTCH-1031: Hey Ken, A gentle reminder for releasing CC.

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13583340#comment-13583340 ] Lewis John McGibbney commented on NUTCH-1031: - Hi Tejas. We released it ;)

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13583662#comment-13583662 ] Tejas Patil commented on NUTCH-1031: Hi Lewis, I should have checked on the main page

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-02-21 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13583664#comment-13583664 ] Tejas Patil commented on NUTCH-1031: @Dev: I am planning to commit this change in

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-23 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13560877#comment-13560877 ] Ken Krugler commented on NUTCH-1031: I've rolled this into trunk at crawler-commons.

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-22 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13560420#comment-13560420 ] Ken Krugler commented on NUTCH-1031: Hi Tejas, I've been on the road, but I'll check

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-20 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13558340#comment-13558340 ] Ken Krugler commented on NUTCH-1031: Hi Tejas - I've looked at your patch, and

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-20 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13558349#comment-13558349 ] Tejas Patil commented on NUTCH-1031: Hi Ken, Thanks for reviewing the patch. I will

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-19 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13558050#comment-13558050 ] Lewis John McGibbney commented on NUTCH-1031: - Is the issue with multiple

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-19 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13558195#comment-13558195 ] Julien Nioche commented on NUTCH-1031: -- bq. 1. Continue to have the legacy code for

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-18 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13557930#comment-13557930 ] Tejas Patil commented on NUTCH-1031: After waiting for more than a week, I think that

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13545958#comment-13545958 ] Julien Nioche commented on NUTCH-1031: -- well we have 2 separate params :

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-07 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13545989#comment-13545989 ] Markus Jelsma commented on NUTCH-1031: -- I think it would be a _very_ good thing to

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2013-01-07 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13546639#comment-13546639 ] Tejas Patil commented on NUTCH-1031: The current nutch robots parsing logic is uses

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2012-06-21 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398338#comment-13398338 ] Lewis John McGibbney commented on NUTCH-1031: - crawler-commons is available

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2012-06-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398340#comment-13398340 ] Julien Nioche commented on NUTCH-1031: -- crawler-commons is not super active and I

[jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons

2012-01-12 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13185285#comment-13185285 ] Lewis John McGibbney commented on NUTCH-1031: - Hi Julien, out of shear