I'm sorry, Rusty.  I just noticed what was the problem with my advice
below, and why the changes to the config file didn't work.  Wherever I
said "url_part_aliases", what I really meant was "common_url_parts".
Sorry about the confusion.  The compiled-in value of url_part_aliases is
empty, as it should be, but it's the compiled-in value of common_url_parts
that changed with the ssl.3 patch.  I think when my brain said
common_url_parts, my fingers typed url_part_aliases just out of habit.
The common_url_parts attribute almost never comes up in discussions on
this list, but questions about url_part_aliases come up almost every week.

In any case, using the patched htsearch is probably your best bet.

According to Rusty Nejdl:
> Gilles,
> 
> Thanks!  That was what was needed.  I think that when I upgraded htdig and use
> the ssl.3 patch, I kept my old version of htsearch.  I still need to clean up
> a few things with the image links being broken now, but thanks again for your
> help!
> 
> Rusty
> 
> Gilles Detillieux(grdetil) wrote:
> > According to Rusty Nejdl:
> > > I applies the simpler solution first, which was to add the url_part_aliases
> > > line as follows:
> > > 
> > > #URL Alias for Patching .gif bug
> > > url_part_aliases: https:// https://www. http:// http://www. ftp:// ftp://ftp.
> > > /pub/ .html .htm .gif .jpg .jpeg /index.html /index.htm .com/ .com mailto:
> > > 
> > > However, after rebuilding the search database, no change.  
> > 
> > You shouldn't have needed to rebuild the database, as the internal
> > setting of url_part_aliases for the patched htdig should already have
> > been this value.  This is, of course, assuming you used the "ssl.3" patch
> > from the patch archive, or some variant of it which had the same value.
> > You can confirm this by searching (with grep) for url_part_aliases in
> > the htcommon/defaults.cc source file.
> > 
> > The important thing it to have this setting in the config file that
> > htsearch uses, so it will decode correctly.  Are you sure you're using
> > the same config file for htsearch as for htdig?
> > 
> > > So, this is where I get to the depth of my knowledge.  I thought that I 
> > > applied the patch correctly, but I guess I didn't.  Where is the htsearch
> > > patch that I need to apply?  How do I apply it?  And, how do I know which patch
> > > to use?
> > 
> > There isn't a separate patch to apply for htsearch.  The ssl.3 patch changes
> > htcommon/defaults.cc, which sets the default attribute values for all the
> > ht://Dig programs, including htdig, htmerge and htsearch.
> > 
> > The problem, as far as I can tell from the information you've given, is
> > that the new htsearch binary, that would have been built after applying the
> > patch, is not installed in your cgi-bin (or wherever you put CGI programs
> > on your system).  Base on the behaviour you described, it sounds like
> > you're still running an older htsearch binary, which was installed from
> > an earlier build of ht://Dig before you applied the patch.  If you find
> > that binary and replace it with the newer one, built from the patched
> > source, it should clear up the problem, without needing to change the
> > config file.  The change to the config file is only needed if for some
> > reason you can't replace the older htsearch binary with the newer one,
> > e.g. if some parts of the web site are indexed with the unpatched htdig,
> > and therefore would need the unpatched htsearch.  If everything is indexed
> > with the same, patched htdig, then you should install the newer htsearch
> > so it will have the newer, patched setting of url_part_aliases built-in.
> > 
> > > Gilles Detillieux(grdetil) wrote:
> > > > According to Rusty Nejdl:
> > > > > Let me add that I don't have the url_part_aliases configed.  The only real 
> > > > > tweaking I did of the conf file is the following:
> > > > ...
> > > > > Also, I have found that ONLY .html files get renamed as .gif files.  Files 
>that
> > > > > have .cgi in them work fine and files that end in .pdf work fine.  That is, 
>the
> > > > > links print correctly.
> > > > 
> > > > I just had a look at the ssl.3 patch on the ftp.ccsf.org archive,
> > > > and it does indeed add a couple new items to the front of the list for
> > > > the compiled-in value of url_part_aliases.  So, the behaviour you're
> > > > describing is consistent with building a database with a patched htdig
> > > > and htmerge, but searching with an unpatched htsearch.  In the patched
> > > > code, .html is the 8th item in the list, while in the unpatched code
> > > > .gif is the 8th item in the list.  Either install the patched htsearch
> > > > on your system, or add the following to your search config file:
> > > > 
> > > > url_part_aliases: https:// https://www. http:// http://www. ftp:// ftp://ftp. 
>/pub/ .html .htm .gif .jpg .jpeg /index.html /index.htm .com/ .com mailto:
> > > > 
> > > > > "If it ain't broke, it doesn't have enough features yet."
> > > >  
> > > > Well, I guess it's got enough features!  :)


-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
Information: http://lists.sourceforge.net/lists/listinfo/htdig-general
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to