OK, you said that none of your documents exceeds max_doc_size, but what
about the size of the directory listing that Apache will generate for
this directory?  With 1477 document in a single directory, a typical
Apache directory listing for that would be over 160 KB.  Depending on
the length of the file names, it could be much more than that.  If you
have a max_doc_size of 100000 (the compiled-in default), that would
easily account for truncation of this directory.

According to pyp:
> Yes, all the documents are in the same directory, and there is no links in
> these documents.
> The .html files contain only text.
> I'll check the apache's logs to see if it's the problem.
> 
> -----Message d'origine-----
> De : Gilles Detillieux [mailto:[EMAIL PROTECTED]]
> Envoye : jeudi 6 juin 2002 20:11
> A : pyp
> Cc : [EMAIL PROTECTED]
> Objet : Re: [htdig] Htdig strange behaviour
> 
> 
> According to pyp:
> > I have 1477 documents in my base.
> > When i run htdig, only 736 documents are indexed.
> > There's no errors, when i print the statistics, the last message is that
> one
> ...
> > Read a total of 2313 bytes
> >  size = 2313
> > pick: www.myserveur.com, # servers = 1
> >
> > It stops here, there is no error.
> > None of my documents exceeds the maximum doc size.
> > I don't know if it's htdig or maybe apache that causes the trouble.
> > Can you help me?
> 
> Are you certain that all of those 1477 documents can be
> found by following HTML links from your start_url on to other
> documents?  If htdig can't see links to some of them, it won't
> know about them.  See http://www.htdig.org/FAQ.html#q5.25 and
> http://www.htdig.org/FAQ.html#q5.18


-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)

_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to