Re: [HACKERS] COPY enhancements

Greg Smith Fri, 09 Oct 2009 08:43:18 -0700

On Fri, 9 Oct 2009, Tom Lane wrote:

what do we do with rows that fail encoding conversion? For logging to afile we could/should just decree that we write out the original,allegedly-in-the-client-encoding data. I'm not sure what we do aboutlogging to a table though. The idea of storing bytea is prettyunpleasant but there might be little choice.

I think this detail can get punted as documented and the error logged, butnot actually handled perfectly. In most use cases I've seen here, savingthe rows to the "reject" file/table is a convenience rather than a hardrequirement anyway. You can always dig them back out of the originalagain if you see an encoding error in the logs, and it's rare you cancompletely automate that anyway.

The main purpose of the reject file/table is to accumulate things youmight fix by hand or systematic update (i.e. add ",\N" for a missingcolumn when warranted) before trying a re-import for review. I suspectthe users of this feature would be OK with knowing that can't be 100%accurate in the face of encoding errors. It's more important that in theusual case, things like bad delimiters and missing columns, that you caneasily manipulate the rejects as simple text. Making that harder just forthis edge case wouldn't match the priorities of the users of this featureI've encountered.


--
* Greg Smith gsm...@gregsmith.com http://www.gregsmith.com Baltimore, MD

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] COPY enhancements

Reply via email to