Re: [PATCHES] CopyReadLineText optimization

Greg Smith Thu, 06 Mar 2008 12:31:09 -0800

On Thu, 6 Mar 2008, Heikki Linnakangas wrote:

At the most conservative end, we could fall back to the current methodon the first escape, quote or backslash character.

I would just count the number of escaped/quote characters on each line,and then at the end of the line switch modes between the current code onthe new version based on what the previous line looked like. That way theonly additional overhead is a small bit only when escapes show up often,plus a touch more just once per line. Barely noticable in the case wherenothing is escaped, very small regression for escape-heavy stuff butcertainly better than the drop you reported in the last rev of this patch.

Rev two of that design would keep a weighted moving average of the totalnumber of escaped characters per line (say wma=(7*wma+current)/8) andswitch modes based on that instead of the previous one. There's enoughplay in the transition between where the two approaches work better atthat this should be easy enough to get a decent transition between.Based on your data I would put the transition at wma>4, which should keepthe old code in play even if only half the lines have the bad regressionthat shows up with >8 escapes per line.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

--
Sent via pgsql-patches mailing list ([email protected])
To make changes to your subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.org&extra=pgsql-patches

Re: [PATCHES] CopyReadLineText optimization

Reply via email to