According to Hauke Meyer:
> System:  htdig-3.1.6, i386-linux, glibc-1.x.
> What I want: eliminate session ID from URL 
> 
> I am new at htdig... and got a problem!
> I try to eliminate a session ID in an URL like
> 
> "http://xyz.aaa.com/sessionID-fixed-32digits/foo/bar.htm";. 
> 
> htdig should show this URL like this:
> 
> "http://xyz.aaa.com/foo/bar.htm";.
> 
> I first try to set the attribute "url_part_aliases" in the htdig.conf
> (used by htdig) and no setting in the htdig.conf used
> by htsearch. 
> 
> Ths first "dummy" test:
> 
> ....
> url_part_aliases:  http://xyz.aaa.com http://grumpf.bbb.org 
> ...
> 
> everything worked!! "http://xyz.aaa.com"; was correctly changed in the output!
> 
> the "real life" test:
> 
> ...
> url_part_aliases:  http://xyz.aaa.com http://xyz.aaa.com/[0-9a-f]\{32\}/
> ...

Where did you get the idea that url_part_aliases supports regular
expressions?  It never has and never will.  If you want to learn how to
use url_part_aliases correctly, read http://www.htdig.org/FAQ.html#q4.17
very carefully at least twice, as well as the attrs.html entry for
url_part_aliases.

However, I can tell you that you're pretty unlikely to accomplish your
goal with url_part_aliases, because of the lack of regular expression
support.  You're better off using url_rewrite_rules or search_rewrite_rules
for this.  Choose one or the other, depending on whether you want the
session IDs stripped off during the indexing phase (before the documents
are fetched by htdig), or just after the fact in search results only.

See http://www.htdig.org/attrs.html#url_rewrite_rules
and http://www.htdig.org/attrs.html#search_rewrite_rules

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to