Hi Peter, I read your tutorial for nutch installation, I installed it and everything works great ... but I have a big doubt.
When I run the crawler, for example in the url directory I have a *. txt in the interior contains: http://www.opentechlearning.com/ And inside the folder 'conf' there are a file 'crawl-urlfilter' must have: + ^ Http:// ([a-z0-9] * \.) * Opentechlearning.com / My question is how (I put the pages in the crawl-urlfilter file) to the following pages: http://cnx.org/lenses/ccotp/endorsements/atom http://ocw.nd.edu/courselist/rss http://openlearn.open.ac.uk/file.php/1/learningspace.xml .... and not starting with www. and that causes me problems I put for example: http://cnx.org/lenses/ccotp/endorsements/atom and + ^ Http:// ([a-z0-9] * \.) *cnx.org/lenses/ccotp/endorsements/atom or +^http://cnx.org/lenses/ccotp/endorsements/atom but when i do the search....nothing appears

