On 8/13/2011 10:57 AM, Paul J Stevens wrote:
> On 08/13/2011 01:10 PM, Robin Horforth wrote:
> 
>> The import takes Dovecot (MailDir or mdbox format), or Panda IMAP (mix)
>> about six minutes to complete.
> 
> I'll take your word for it.

As a follow-up. I added imap->search() after EACH append() on the same vmWare 
setup, which gives me the following import times on Dovecot 2.0.13 (SiS posix 
attachment storage, mdbox format):

real    45m42.943s
user    2m39.620s
sys     0m21.671s

222616  /mail/home/usertest
363724  /mail/attachments/
586340  total

This reflects a full-text search indexing configuration of "fts_squat = 
partial=4 full=10", which provides for IMAP4r1 complianet substring searching 
in both headers + body areas.

The gap's closed considerably of course, and I feel this is closer to an equal 
playing field in the way of comparing append/insertion performance.  But the 
difference in performance is still very large. Sure, I expect to "pay" 
something for using SQL and a relatively higher latency backing store - but I 
didn't realise it'd be this much.

As a sanity check to ensure the searches are being properly accelerated by 
Dovecot's FTS indexing, I performed a quick set of IMAP SEARCH TEXT commands 
against some of the larger mailboxes which look at "not found", "all found", 
and substring-stressor "moderate" result sets for comparison purposes:
                                                                     
Searching INBOX2010 #msgs = 4729 (Cold Indexing Time=14.784021s)

 [NOFIND] Time=0.02358, matches=0
 [date] Time=0.009816, matches=4729
 [here] Time=0.006297, matches=2644

Searching INBOX2006 #msgs = 4149 (Cold Indexing Time=7.055049s)
 [NOFIND] Time=0.020683, matches=0
 [date] Time=0.008533, matches=4149
 [here] Time=0.004641, matches=1734

With all FTS functionality turned off:

Searching INBOX2010 #msgs = 4729
 [NOFIND] Time=6.815283, matches=0
 [date] Time=0.494466, matches=4729
 [here] Time=5.035462, matches=2643

Searching INBOX2006 #msgs = 4149
 [NOFIND] Time=2.721383, matches=0
 [date] Time=0.374643, matches=4149
 [here] Time=2.156622, matches=1734

IMAP-compliant SEARCHes are indeed benefiting from the fts_squat option.

=R=
_______________________________________________
DBmail mailing list
[email protected]
http://mailman.fastxs.nl/cgi-bin/mailman/listinfo/dbmail

Reply via email to