I think it would help if the examples in the documentation had explanations as to what the examples are trying to accomplish.
From the url_part_aliases doc it says "A list of translations pairs from and to, used when accessing the database. If a part of an URL matches with the from-string of each pair, it will be translated into the to-string just before writing the URL to the database, and translated back just after reading it from the database." url_part_aliases: http://mgate/ http://balius.com/ I indexed using no url_part_aliases and put the above in the htdig.conf file for the search to read. I got back URLs that begin with http://mgate/ What am I not understanding here? I can architect a mail system to support 1M users easily and yet this stupid thing has me stumped. :) -Chad- At 11:12 AM 2/5/2002 -0600, Gilles Detillieux wrote: >According to Chad M. Stewart: > > Gilles, > > > > I hope you don't mind me contacting you directly, but from my searching > of the > > archives indicates that you know htdig well. If you ever have a question > > about iPlanet Messaging Server I can return the favor. :) > >I don't mind, but I do generally put htdig-related mail that's off-list >at the back of the queue. If you'd prefer a faster response, the mailing >list is a better option. > > > I found your note here, > > http://www.geocrawler.com/archives/3/8825/2001/3/0/5440702/ and read it > and I > > agree with you that newbies have a hard time understanding > > url_part_aliases. I understand that it is a compression algorithm. What I > > don't understand is the syntax of such. The way I understand it during > index > > A-->B and then during search B-->A. If that is right then I don't think it > > will help my situation. I'd be happy to take a crack at writing a newbie > > understandable example of it, if I only understood the syntax. > >I wrote that note back in March of last year. Since then, there has >been a lot more discussion, and I've made changes to the documentation >for url_part_aliases in the new 3.1.6 release, as well as substantial >additions to http://www.htdig.org/FAQ.html#q4.17 to explain the common >pitfalls when people ignore the documentation's recommendations. >Please have another look and see if that doesn't help you understand >how to use this attribute. > > > Here is what I want to do and I can not figure out how to do it. > > > > I indexed files using 'http://mgate/ims/' as the start_url. When I do a > > search I get 'http://mgate/ims' back as the start of the url, > expected. The > > problem is that the host, mgate, is not accessible from everywhere. If > I do > > the search from my LAN the resulting http://mgate/... is valid. If you > were > > to do the search http://mgate/... would be invalid. If I made the URL > valid > > for you, then it would be invalid for my LAN. > > > > From the LAN the hostnames mgate and mgate.amotken.com are valid and > > reachable. From the Internet those hostnames are invalid. People on the > > Internet can reach the exact same content by using www.balius.com. I > can not > > use www.balius.com while on my LAN that would be invalid. > > > > Thus my quandry of how to have the host part of the URL changed. I > don't mind > > having two different search pages, one for LAN one for WAN. If that is the > > way to solve my quandry then how to I get the search from the Internet > side to > > be changed to say www.balius.com when the data was indexed using mgate? > >That's a pretty simple application of url_part_aliases. This would >only need one pair of strings in the definition, with the left-hand-side >changing in the search config file used on the Internet side. > >-- >Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> >Spinal Cord Research >Centre WWW: http://www.scrc.umanitoba.ca/~grdetil >Dept. Physiology, U. of Manitoba Phone: (204)789-3766 >Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

