Hi All,

I've only recently discovered Lucene / Nutch and I'm extremely impressed with 
it's indexing ability and production of relevant results.

Can anyone give me any guidance or point me at documents that could help me 
index password protected content?

I have a web site, www.sheerpoetry.co.uk , which is run by four UK poets whose 
work is studied by children for school / college exams in the UK.  We'd like to 
use Nutch to index and display search results (by customising the Nutch demo 
web-app).  Our problem is that the web site content is protected by a Tomcat 
Security Realm so that only registered, logged in users can view the content.  
This means that the Nutch crawler will somehow have to login to index the 
content.  If anyone has any suggestions on how to do, I'd be most grateful to 
hear them.

Kind Regards,
Chris


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to