Hi Jim, After rebuilding, my results are now looking correct, however my exclude list is too large....it appears htsearch is getting the error code "15" back from regcomp, which indicates this in gregex.h: REG_ESIZE, /* Compiled pattern bigger than 2^16 bytes. */
So, my exclude pattern is just too big....is there any way to increase that limit? Cheers, Jonathan. > On Wed, 18 Aug 2004, Jonathan Schlackl wrote: > >> I'm just in the middle of upgrading to 3.2.0b6 from 0b5 and have run >> into a problem. >> >> I successfully built a database but all my search results have >> corrupted >> URLs.....they all contain "mailto:" in the middle of them like this: >> http://developmentminitmailto:baseline/baseline2003/baseline-59.html >> >> The correct URL should be: >> http://development.comminit.com/baseline/baseline2003/baseline-59.html > > Did you delete all pre-existing database files before running this dig > with the new version? > > Are you sure that you are using the same versions of htdig and htsearch? > The default setting for common_url_parts did change between b5 and b6, so > if you are using a program from one version to dig and another to search, > I think this type of corruption would probably be expected. > >> It looks like ".com" is being rewritten somehow but I don't have >> common_url_parts OR url_part_aliases set anywhere, just using the >> defaults for those..... > > Did you customize either of these attributes in the past? If so, did you > rebuild from scratch after setting them back to the defaults. > >> Do I need to set those config options differently for 3.2.0b6?? > > If the answers to the above questions are no, I would try manually setting > common_url_parts to something that doesn't include '.com', and then try > rebuilding the databases from scratch. > > Jim > > > ------------------------------------------------------- > SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media > 100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33 > Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift. > http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285 > _______________________________________________ > ht://Dig general mailing list: <[EMAIL PROTECTED]> > ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html > List information (subscribe/unsubscribe, etc.) > https://lists.sourceforge.net/lists/listinfo/htdig-general > ------------------------------------------------------- SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media 100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33 Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift. http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285 _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general