Re: [PATCHES] CopyReadLineText optimization

Andrew Dunstan Thu, 06 Mar 2008 12:45:36 -0800


Greg Smith wrote:

On Thu, 6 Mar 2008, Heikki Linnakangas wrote:
At the most conservative end, we could fall back to the currentmethod on the first escape, quote or backslash character.
I would just count the number of escaped/quote characters on eachline, and then at the end of the line switch modes between the currentcode on the new version based on what the previous line looked like.That way the only additional overhead is a small bit only when escapesshow up often, plus a touch more just once per line. Barely noticablein the case where nothing is escaped, very small regression forescape-heavy stuff but certainly better than the drop you reported inthe last rev of this patch.
Rev two of that design would keep a weighted moving average of thetotal number of escaped characters per line (saywma=(7*wma+current)/8) and switch modes based on that instead of theprevious one. There's enough play in the transition between where thetwo approaches work better at that this should be easy enough to get adecent transition between. Based on your data I would put thetransition at wma>4, which should keep the old code in play even ifonly half the lines have the bad regression that shows up with >8escapes per line.

I'd be inclined just to look at the first buffer of data we read in, andmake a one-off decision there, if we can get away with it. Then the costof testing is fixed rather than per line.


cheers

andrew

--
Sent via pgsql-patches mailing list (pgsql-patches@postgresql.org)
To make changes to your subscription:
http://mail.postgresql.org/mj/mj_wwwusr?domain=postgresql.org&extra=pgsql-patches

Re: [PATCHES] CopyReadLineText optimization

Reply via email to