[Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam

2009-03-31 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The following page has been changed by susam: http://wiki.apache.org/nutch/HttpAuthenticationSchemes -- ===

Nutch Topical / Focused Crawl

2009-03-31 Thread - -
Hi @ all, I'd like to turn Nutch into an focused / topical crawler. It's a part of my final year thesis. Further, I'd like that others can contribute from my work. I started to analyze the code and think that I found the right peace of code. I just wanted to know if I am on the right track. I thin

[jira] Updated: (NUTCH-578) URL fetched with 403 is generated over and over again

2009-03-31 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-578: -- Attachment: NUTCH-578_v3.patch changes in CrawlDbReducer already applied in trunk, so patch only

Where to find Lucene Source code??

2009-03-31 Thread Sherjeel Niazi
Hi there, I have configured nutch 0.9 on eclipse using tutorial http://wiki.apache.org/nutch/RunNutchInEclipse0.9 What I would like to do is, include lucene source code instead of its jar file. So from where I can download lucene source code. Thanks, Sherjeel.