According to Hauke Meyer: > System: htdig-3.1.6, i386-linux, glibc-1.x. > What I want: eliminate session ID from URL > > I am new at htdig... and got a problem! > I try to eliminate a session ID in an URL like > > "http://xyz.aaa.com/sessionID-fixed-32digits/foo/bar.htm". > > htdig should show this URL like this: > > "http://xyz.aaa.com/foo/bar.htm". > > I first try to set the attribute "url_part_aliases" in the htdig.conf > (used by htdig) and no setting in the htdig.conf used > by htsearch. > > Ths first "dummy" test: > > .... > url_part_aliases: http://xyz.aaa.com http://grumpf.bbb.org > ... > > everything worked!! "http://xyz.aaa.com" was correctly changed in the output! > > the "real life" test: > > ... > url_part_aliases: http://xyz.aaa.com http://xyz.aaa.com/[0-9a-f]\{32\}/ > ...
Where did you get the idea that url_part_aliases supports regular expressions? It never has and never will. If you want to learn how to use url_part_aliases correctly, read http://www.htdig.org/FAQ.html#q4.17 very carefully at least twice, as well as the attrs.html entry for url_part_aliases. However, I can tell you that you're pretty unlikely to accomplish your goal with url_part_aliases, because of the lack of regular expression support. You're better off using url_rewrite_rules or search_rewrite_rules for this. Choose one or the other, depending on whether you want the session IDs stripped off during the indexing phase (before the documents are fetched by htdig), or just after the fact in search results only. See http://www.htdig.org/attrs.html#url_rewrite_rules and http://www.htdig.org/attrs.html#search_rewrite_rules -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

