As part of the crawl, we're indexing an affiliate site and need to
massage their urls for display in the search results.  So I look through
the docs and find "search_rewrite_rules" and it sounds like it'll do the
trick. 

Now I'm no wiz at regular expressions, but here's what I want to do: 
original url: 
http://www.domain.ca/PrinterFriendly2.cfm?ArticleId=ZZZ 
the url I want: 
http://www.domain.ca/index.cfm?Param=YYY&ArticleId=ZZZ 

So I put this in the htdig.conf (all on one line): 

search_rewrite_rules:  
http://www\\.domain\\.ca/PrinterFriendly2\\.cfm\\?ArticleId=(.*)    
http://www\\.domain\\.ca/index\\.cfm\\?PgNm=TCE&ArticleId=\\1 

And, of course, it doesn't work.

So, more searching in the faq and maillist and I come across the entry
for url_part_aliases but it implies that I should use either
url_rewrite_rules or search_rewrite_rules.

I guess what I'm asking is which method is best for rewriting, and
what's wrong with my regex?

Thanks,
Greg



-- 
Greg Burnham
Lead Technologist
7th Floor Media
SFU @ HC
515 West Hastings,
Vancouver, BC
V6B 5K3
604 291 5277 (ph)
604 291 5173 (fx)


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to