Aha! This is to report that I have fixed the bug in the document databases. This bug surfaced significantly with META description tags which seemed to "infect" documents which did not have them. (I actually haven't tested the fix thoroughly, but on rebuilding the databases, I didn't see the bug in any of my searches so...) I'm posting a patch for this bug, though I expect to release version 3.1.0b2 very soon. This version will contain this fix, several other bug-fixes (including the patch for Connection.cc on Dec Alphas) and a few new features. One note--since this bug corrupts the databases, I suggest rebuilding them from scratch after applying the patch. While I know this is an inconvenience, you can't repair your data otherwise. :-( -Geoff Hutchison Williams Students Online http://wso.williams.edu/ P.S. I know I promised merging multiple databases in this release. I do not have this (or a number of other promised patches). I feel the bugs that are fixed are significant enough to force an "early" release. *** htdig-3.1.0b1/htdig/Retriever.cc Mon Sep 7 23:29:55 1998 --- htdig3.dev/htdig/Retriever.cc Sat Oct 17 18:16:11 1998 *************** Retriever::Retriever() *** 79,85 **** factor[7] = config.Double("heading_factor_6"); factor[8] = factor[9] = 0; factor[10] = config.Double("keywords_factor"); ! factor[11] = config.Double("meta_desription_factor"); // // Open the file to which we will append words. --- 87,93 ---- factor[7] = config.Double("heading_factor_6"); factor[8] = factor[9] = 0; factor[10] = config.Double("keywords_factor"); ! factor[11] = config.Double("meta_description_factor"); // // Open the file to which we will append words. *************** Retriever::parse_url(URLRef &urlRef) *** 280,286 **** ':' << url.get() << ": "; cout.flush(); } ! doc->Url(url.get()); doc->Referer(urlRef.Referer()); --- 290,298 ---- ':' << url.get() << ": "; cout.flush(); } ! ! delete doc; ! doc = new Document; doc->Url(url.get()); doc->Referer(urlRef.Referer()); *************** Retriever::RetrievedDocument(Document &d *** 436,441 **** --- 457,464 ---- current_ref = ref; current_anchor_number = 0; current_title = 0; + current_head = 0; + current_meta_dsc = 0; // // Create a parser object and let it have a go at the document. ---------------------------------------------------------------------- To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] containing the single word "unsubscribe" in the body of the message.
