According to Abbie Greene: > I recently ran version 3.2.0b4 and am receiving great search results, > except that it didn't index what seems like 3/4 of my files. I have an > entire directory not showing up in searches, and the database size is > extremely small compared to the number of files I have sitting on my > machine. All in all I have over 26000 files that are sitting in an > 'index.html' file with links to all of them. It was originally indexing > both directories when there were only 4000 files total.
See http://www.htdig.org/FAQ.html#q5.1 and http://www.htdig.org/FAQ.html#q5.25 If your index.html file has links to 26000 files, it must be huge. Are you sure max_doc_size is generous enough? If that's not the problem, 5.25 will offer lots of other suggestions. > Is there anyway > to find out exactly how many files got indexed? Or what directories? It's hard to get those numbers after the fact, especially in 3.2.0b4. In 3.1.6, htmerge -s will give some stats on numbers of documents, but this option isn't (yet) supported in htpurge in 3.2. Your best bet is to run htdig with at least one -v option, to get the running list of URLs as it indexes them. If htdig is missing or rejecting links that you think it should be seeing, you'd probably need at least -vvv to get enough info about the reasons. Note that this will generate lots of output, which you should redirect to a file. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) ------------------------------------------------------- This SF.net email is sponsored by: SlickEdit Inc. Develop an edge. The most comprehensive and flexible code editor you can use. Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial. www.slickedit.com/sourceforge _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

