Author: Alex Barkov Email: [EMAIL PROTECTED] Message: To Dmitry, this is definitly thing for FAQ.
If indexer doesn't store in database some links that are expected to be stored, two reasons are possible: 1. Indexer doesn't find links on the page. This is probably a bug in HTML parser in this case. 2. Indexer does find links, but rejects them. This means that indexer.conf configuration doesn't allow these links. (Or again probably a bug in indexer.conf related code). Let's say a page http://localhost/links.html contains such ignored links. To discover the reason, start indexer with these command line arguments: indexer -am -u http://localhost/links.html It will display various debug information, including: - every found link on this page - information why indexer rejects found link or accepts it Seeing this information, it is easily to detect what is the reason of unexpected indexer behaviour: either 1 or 2. When you know a reason, either reconfigure indexer.conf and check again, or feel free to report it as a bug when you're sure that everything is OK in your indexer.conf. When reporting such bugs, please always include indexer -am -u http://localhost/links.html output, as well as Server/Ream Allow/Disallow commands from your indexer.conf Hope this helps. > I have finally got the cgi scripts working. And there was much rejoicing so that I >can use the search.cgi. I've modifed the indexer.conf to the way in which I think it >works, but when I run it, I can clearly see that pages the site references are not >being indexed. It does read some other levels but not all. > > Changes I have made in the indexer.conf are : > > Realm http://10.8.2.18/* > URL http://10.8.2.18/level4/uk/dell/dell.htm (to try and force it to read a page >which is not being referrenced) > Server http://10.8.2.18/ > > And that's about it really. > > Any thoughts would be appreciated. > Paul > Reply: <http://www.mnogosearch.org/board/message.php?id=3524> ___________________________________________ If you want to unsubscribe send "unsubscribe general" to [EMAIL PROTECTED]
