"Briggs, Gary" wrote:
> 
> OK, so I've been reading the source code, and I'm having real trouble with
> what some parts of the url table are for...
> 
> Why bother to compute a crc32 for the urls? do I need it? [I'm currently
> using crc-multi db mode on mySQL]

It is url.crc32 of document content. It helps to detect and eliminate
clones, i.e. absolutely the same documents but under different 
locations.



> Why is there a keywords field? I thought that the search worked by:
> 
> 0)  Compute the crc's of the keywords we're looking for
> 1)  looking up the crc's we're searching for from the dict tables
> 2)  using url_id as a foreign key, look up the relevant url for rec_id key
> in the url table
> 3)  also look up all the other information from the url table, such as
> description, title, text
> 
> Surely this doesn't need a keyword field, since we're searching other tables
> based on keywords anyway?


  url.keywords field may be displayed in search template. There is no
another
  usage of this field.



> What's the difference between txt and description? I assume that Description
> is the descrption if there's a description meta-tag, and txt is an extract
> of the text.

  That's right.

> Why is there both? Surely a unified field that could contain description, if
> it's there, or an extract, if there's not?


   You may use both fields in search template. There is also $DX
variable
which is description when it exists and extract of the text in other
case.


   All fields: keywords, descr, title, txt are used only for search
results
displaying.

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to