Hi, we're using Nutch 0.8. In deafault.xml "ignore external links" is set "true". Can anybody tell me where we can find the code to this property? We've got the problem, that now, there are many "intern" pages, that aren't indexed. Doesn't seem to make sense, because they are on the same server, like other indexed pages. When we set "ignore external links" "false" they are indexed. What could be the problem?
Peter ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
