[jira] Updated: (NUTCH-44) too many search results

2007-09-08 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-44?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-44: --- Attachment: NUTCH-44.patch Attached a patch. To apply:- patch -p0 NUTCH-44.patch ant war cp

[jira] Updated: (NUTCH-44) too many search results

2007-09-08 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-44?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-44: --- Attachment: (was: NUTCH-44.patch) too many search results --- Key:

[jira] Updated: (NUTCH-44) too many search results

2007-09-08 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-44?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-44: --- Attachment: NUTCH-44.patch Updated my previous patch to fix the issue in opensearch too. To apply:- patch

[jira] Updated: (NUTCH-281) cached.jsp: base-href needs to be outside comments

2007-09-09 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-281: Attachment: NUTCH-281.patch Uploading a patch. Put the base tag outside comments and now the relative

[jira] Created: (NUTCH-557) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

2007-09-18 Thread Susam Pal (JIRA)
protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication -- Key: NUTCH-557 URL: https://issues.apache.org/jira/browse/NUTCH-557 Project: Nutch Issue

[jira] Updated: (NUTCH-557) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

2007-09-18 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-557: Attachment: protocol-http11v0.1.patch I have generated this patch against Nutch trunk. To apply:- patch

[jira] Updated: (NUTCH-557) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

2007-09-18 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-557: Priority: Minor (was: Major) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

[jira] Commented: (NUTCH-557) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

2007-09-19 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12528854 ] Susam Pal commented on NUTCH-557: - No, there isn't any significant difference in performance. Here's a list of the

[jira] Created: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server

2007-09-24 Thread Susam Pal (JIRA)
NTLM, Basic and Digest Authentication schemes for web/proxy server -- Key: NUTCH-559 URL: https://issues.apache.org/jira/browse/NUTCH-559 Project: Nutch Issue Type: Improvement

[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server

2007-09-24 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-559: Attachment: NUTCH-559v0.1.patch I have generated this patch against Nutch trunk. It will add support for

[jira] Closed: (NUTCH-557) protocol-http11 for HTTP 1.1, HTTPS, NTLM, Basic and Digest Authentication

2007-09-24 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal closed NUTCH-557. --- Resolution: Won't Fix As per the discussion, 'protocol-http11' has been turned into a patch for

[jira] Issue Comment Edited: (NUTCH-539) HttpClient plugin does not work with BasicAuthentication

2007-09-25 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530175 ] susam edited comment on NUTCH-539 at 9/25/07 10:54 AM: --- 1. There is a bug in the patch. The

[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server

2007-09-25 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-559: Priority: Major (was: Minor) Apart from adding the authentication features, this patch would fix three

[jira] Commented: (NUTCH-560) protocol-httpclient reading more bytes than http.content.limit

2007-09-26 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12530519 ] Susam Pal commented on NUTCH-560: - I analysed 'protocol-http' and it behaves almost in the same manner. While

[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server

2007-11-01 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-559: Attachment: NUTCH-559v0.4.patch Uploading a revised (v0.4) patch that has all authentication configuration

[jira] Updated: (NUTCH-559) NTLM, Basic and Digest Authentication schemes for web/proxy server

2007-11-28 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-559: Attachment: NUTCH-559v0.5.patch Uploading a revised (v0.5) patch with some test cases. Added a 'scheme'

[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-04 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-601: Attachment: NUTCH-601v0.2.patch Attached a revised patch (NUTCH-601v0.2.patch), which removes the old

[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-04 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-601: Attachment: NUTCH-601v0.1.patch Patch attached. Recrawling on existing crawl directory using force option

[jira] Created: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-04 Thread Susam Pal (JIRA)
Recrawling on existing crawl directory using force option - Key: NUTCH-601 URL: https://issues.apache.org/jira/browse/NUTCH-601 Project: Nutch Issue Type: Improvement Affects

[jira] Commented: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-05 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12565848#action_12565848 ] Susam Pal commented on NUTCH-601: - The 'if (newIndex != index)' condition is just a check

[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-15 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-601: Attachment: NUTCH-601v1.0.patch Attached another patch (NUTCH-601v1.0.patch) that always deletes the old

[jira] Updated: (NUTCH-601) Recrawling on existing crawl directory using force option

2008-02-15 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-601: Attachment: NUTCH-601v0.3.patch Attached a revised patch (NUTCH-601v0.3.patch) that makes the code simpler

[jira] Updated: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl

2008-02-15 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-612: Attachment: NUTCH-612v0.1.patch Attached patch to fix the bug. This modifies Crawl.java and Generator.java.

[jira] Created: (NUTCH-612) URL filtering is always disabled in Generator when invoked by Crawl

2008-02-15 Thread Susam Pal (JIRA)
URL filtering is always disabled in Generator when invoked by Crawl --- Key: NUTCH-612 URL: https://issues.apache.org/jira/browse/NUTCH-612 Project: Nutch Issue Type: Bug

[jira] Created: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command

2009-05-09 Thread Susam Pal (JIRA)
crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command -- Key: NUTCH-735 URL: https://issues.apache.org/jira/browse/NUTCH-735 Project: Nutch

[jira] Updated: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command

2009-05-09 Thread Susam Pal (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susam Pal updated NUTCH-735: Attachment: NUTCH-735v0.1.patch Attached patch. crawl-tool.xml must be read before nutch-site.xml when