On Thu, 28 Jun 2001, Pietro Palladino wrote:
> It follows the following steps:
> 1: Digging---> htdig -i -a -s -v
> 2: Purging--->htpurge �-a -v
> 3: Moving (?) --->there are some instructions that do something like moving
> the databases .work in the same ones but with the extension .db....
Not quite. If you're using the -a flag, then you'll want to remove the
.work extension.
> I've 2 questions:
> Where is the merging step?
There isn't one. The merging step was needed by versions prior to the 3.2
betas to construct the search databases from those produced by htdig. At
this point, htdig produces databases that essentially can be searched
directly. (It's generally a good idea to run htpurge over them to remove
any bad URLs, etc.)
> Which program creates the "weakcmpr" database?
It's created by htdig--it's a bookkeeping database for the word database.
> Does this step create the soundex and metaphone databases ?
> What does soundex mean? does it mean that soundex database contains words
> that sounds like other ones (4 example: cut, cat ....)? And what does
> "metaphone" means?
Yes, this would create soundex and metaphone databases--both are mostly
English algorithms for "sounds like" based on approximating the phonetic
spelling of a word. Metaphone is more accurate, but closely tied to
American English. (The soundex algorithm was invented for filing immigrant
records in the US, so it works OK with other European languages.)
> What do they do?
This checks to see if the endings database needs to be rebuilt from the
affix file.
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html