Geoff,

Let me add that I don't have the url_part_aliases configed.  The only real 
tweaking I did of the conf file is the following:

start_url:              https://www.training.verio.net/ https://www.training.ver
io.net/tech/ https://www.training.verio.net/provisioning/ https://www.training.v
erio.net/mgmt/ https://www.training.verio.net/billing/collections/ https://www.t
raining.verio.net/billing/cbc/ https://www.training.verio.net/tech/docs/ https:/
/www.training.verio.net/billing/cbc/docs/ https://www.training.verio.net/billing
/collections/docs/ https://www.training.verio.net/sales/docs/ https://www.traini
ng.verio.net/provisioning/docs/   

max_head_length:        100000 

max_doc_size:           95000000  

external_parsers: application/msword /usr/local/bin/parse_doc.pl \
                  application/postscript /usr/local/bin/parse_doc.pl \
                  application/pdf /usr/local/bin/parse_doc.pl    

Also, I have found that ONLY .html files get renamed as .gif files.  Files that
have .cgi in them work fine and files that end in .pdf work fine.  That is, the
links print correctly.

Rusty

Geoff Hutchison(ghutchis) wrote:
> On Thu, 8 Feb 2001, Rusty Nejdl wrote:
> 
> > /opt/www/htdig/bin/htdig -u htdig:htd1ggles -i -s
> > /opt/www/htdig/bin/htmerge -c /opt/www/htdig/conf/htdig.conf -vvv -s
> 
> Yes, these will clear the databases every time. I don't think it has
> anything to do with common_url_parts because you say the URLs look right
> in the database itself. This leaves two possibilities:
> 
> 1) There's a URL mapping activated (e.g. with url_part_aliases or the
> url_rewrite patch) that's not quite right.
> 2) There's something amiss with your patched version of the source--you
> said you used someone's SSL patch, right?
> 
> But if the URLs are right in the database, then there's something going on
> in your htsearch code.
> 
> --
> -Geoff Hutchison
> Williams Students Online
> http://wso.williams.edu/

-- 
Rusty Nejdl <[EMAIL PROTECTED]>
"If it ain't broke, it doesn't have enough features yet."

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
Information: http://lists.sourceforge.net/lists/listinfo/htdig-general
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to