According to Abbie Greene:
> I recently ran version 3.2.0b4 and am receiving great search results,
> except that it didn't index what seems like 3/4 of my files.  I have an
> entire directory not showing up in searches, and the database size is
> extremely small compared to the number of files I have sitting on my
> machine.   All in all I have over 26000 files that are sitting in an
> 'index.html' file with links to all of them.  It was originally indexing
> both directories when there were only 4000 files total.

See http://www.htdig.org/FAQ.html#q5.1
and http://www.htdig.org/FAQ.html#q5.25

If your index.html file has links to 26000 files, it must be huge.  Are
you sure max_doc_size is generous enough?  If that's not the problem,
5.25 will offer lots of other suggestions.

> Is there anyway
> to find out exactly how many files got indexed?   Or what directories?

It's hard to get those numbers after the fact, especially in 3.2.0b4.
In 3.1.6, htmerge -s will give some stats on numbers of documents, but
this option isn't (yet) supported in htpurge in 3.2.  Your best bet is
to run htdig with at least one -v option, to get the running list of
URLs as it indexes them.  If htdig is missing or rejecting links that
you think it should be seeing, you'd probably need at least -vvv to
get enough info about the reasons.  Note that this will generate lots
of output, which you should redirect to a file.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This SF.net email is sponsored by: SlickEdit Inc. Develop an edge.
The most comprehensive and flexible code editor you can use.
Code faster. C/C++, C#, Java, HTML, XML, many more. FREE 30-Day Trial.
www.slickedit.com/sourceforge
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to