Re: [Dbmail-dev] header storage schema changes. (W)Here we (may) go.

Paul J Stevens Sat, 1 Jan 2005 10:32:10 +0100 (CET)

Matthew T. O'Connor wrote:

Personally, I think we should use a very clean and generic design forheaders / header searching.
First: Leave the current table structures intact, I don't think werequire any modifications to them.
Second:  Add 2 new tables: header_list and header_values

Breaking out the header names to a table of their own may be useful. But doing it just for storage's sakeseems a bit overkill given the added complexity in constructing queries and maintaining data integrity. Ifit's boosts performance for the target use-cases (search,sort,thread) I all for it, if.

header_list: ( Contains an exhaustive list of all headers from allmessages in the database. )
   header_id   int     primary key
   header        text   not null
header_values: ( Contains the values from all the headers in all themessages in database )
   header_value_id   serial primary key
message_id int (references unique message ID from thephysmessage table)
   header_id      int    (references unique ID from the header_list table)
   header_value   text  (the actual value from this header in this message)
hearder_order int ( optional column, used to be able to recreatethe header order from the original message)

Header_order will never happen. Recreating headers from the header tables will never happen. Complete headersare stored in the messageblk. Also, header-order from the original message is already *not* being maintained.Gmime does it's own reformatting and reshuffling of the headers.

This structure will make it very easy to query all the headers from agiven message or find all the messages with a given header, or a givenheader value. It also leaves our current structure intact which willmake it easier to phase in.

Agreed. Starting with a single separate headers table, or with two tables like you propose will probably bethe starting point. Once we have consistent storage of headers, it will be relatively easy to move certainheaders to tables of their own, or merge them into the physmessage table. Of course, postgres users couldprobably even use triggers for stuff like that.

What do you think? I don't think we need to special case any headersnot even sendername or subject.

Well, yukatan's datamodel looks like a very serious attempt at optimizing datastorage for email. My workingassumption is that there are some very valid reasons for doing it the way they're doing things. Also, as along term goal, a unified model for sql based email storage is something I think about.




--
  ________________________________________________________________
  Paul Stevens                                  mailto:[EMAIL PROTECTED]
  NET FACILITIES GROUP                     PGP: finger [EMAIL PROTECTED]
  The Netherlands________________________________http://www.nfg.nl

Re: [Dbmail-dev] header storage schema changes. (W)Here we (may) go.

Reply via email to