Re: DBD::mysql path forward

Patrick M. Galbraith Sun, 24 Sep 2017 13:56:44 -0700

Thank you for concern, I completely understand.

We have no intention of releasing anything that would do this (datacorruption) and testing will ensure this. The main objective here isthat DBD::mysql is on par with all the other drivers, the whole ideabehind a driver that DBI can use and code should work the sameregardless of underlying RDBMS. Having worked with other languages inthe last few years (PDO, Go/Gorm, Python, ODBC, etc) it's something Iwant for Perl and MySQL as well.


Regards,

Patrick

On 9/19/17 12:10 PM, Darren Duncan wrote:

What Night Light's post says to me is that there is high risk ofcausing data corruption if any changes are made under the DBD::mysqlname where DBD::mysql has not been exhaustively tested to guaranteethat its behavior is backwards compatible.
This makes a stronger case to me that the DBD::mysql Git master (thatwhich includes the 4.042 changes and any other default breakingchanges) should rename the Perl driver package name, I suggestDBD::mysql2 version 5.0, and that any changes not guaranteed backwardscompatible for whatever reason go there.
If the Git legacy maintenance branch 4.041/3 can have careful securitypatches applied that don't require any changes to user code to preventbreakage, it gets them, and otherwise only DBD::mysql2 gets any changes.
By doing what I said, we can be guaranteed that users with no controlover how DBD::mysql gets upgraded for them will introduce corruptionsimply for upgrading.
-- Darren Duncan

On 2017-09-19 5:46 AM, Night Light wrote:
Dear Perl gurus,
This is my first post. I'm using Perl with great joy, and I'd like toexpress my
gratitude for all you are doing to keep Perl stable and fun to use.
I'd like to ask to object to re-releasing this version and discuss onhow to
make 4.043 backwards compatible instead.
This change will with 100% certainty corrupt all BLOB data written tothedatabase when the developer did not read the release notes beforeapplying the
latest version of DBD::mysql (and changed its code consequently).
Knowing that sysadmins have the habit of not always reading therelease notes ofeach updated package the likelihood that this will happen willtherefore high.I myself wasn't even shown the release notes as it was a dependencyof an
updated package that I applied.
The exposure of this change is big as DBD::mysql affects multipleapplications
and many user bases.
I believe deliberately introducing industry wide database corruption is
something that will significantly harm peoples confidence in using Perl.
I believe that not providing backwards compatibility is not in linewith thePerl policy that has been carefully put together by the community tomaintain
the quality of Perl as it is today.
http://perldoc.perl.org/perlpolicy.html#BACKWARD-COMPATIBILITY-AND-DEPRECATION
I therefore believe the only solution is an upgrade that is bydefault backwardscompatible, and where it is the user who decides when to start UTF8encode the
input values of a SQL request instead.
If it is too time consuming or too difficult it should be consideredto park the
UTF8-encoding "fix" and release a version with the security fix first.

I have the following objections against this release:
1. the upgrade will corrupt more records than it fixes (it does moreharm than good)2. the reason given for not providing backward compatibility("because it washard to implement") is not plausible given the level of unwanted sideeffects. This especially knowing that there is already a mechanism in placeto signalif its wants UTF8 encoding or not(mysql_enable_utf8/mysql_enable_utf8mb4).3. it costs more resources to coordinate/discuss a "way forward" oroptions than
to implement a solution that addresses backwards compatibility
4. it is unreasonable to ask for changing existing source knowingthat depending
modules may not be actively maintained or proprietary
It can be argued that such module should always be maintained butit does not
change the fact that a good running Perl program becomes unusable
5. it does not inform the user that after upgrading existing codewill start
write corrupt BLOB records
6. it does not inform the user about the fact that a code review ofall existing
code is necessary, and how it needs to be changed and tested
7. it does not give the user the option to decide how the BLOB'sshould be
stored/encoded (opt in)
8. it does not provide backwards compatibility
By doing so it does not respect the Perl policy that has beencarefully puttogether by the community to maintain the quality of Perl as it istoday.
http://perldoc.perl.org/perlpolicy.html#BACKWARD-COMPATIBILITY-AND-DEPRECATION
9. it blocks users from using DBD::mysql upgrades as long as theyhave not
rewritten their existing code
10. not all users from DBD::mysql can be warned beforehand about thesideeffects as it is not known which private parties have code that useDBD::mysql
12. I believe development will go faster when support for backwards
compatibility is addressed
13. having to write 1 extra line for each SQL query value is a monksjob that
will make the module less attractive to use

About forking to DBD::mariadb?:
The primary reason to create such a module is when the communicationprotocol of
Mariadb has become incompatible with Mysql.
To use this namespace to fix a bug in DBD::mysql does not meet thatcriteria andcauses confusion for developers and unnecessary pollution of the DBDnamespace.
---
For people that do not know the impact of the change that is pendingto be
committed:
(see Github issue that includes 3 reports of companies that suffereddata loss
https://github.com/perl5-dbi/DBD-mysql/issues/117 )

Issue: some UTF8 characters are not properly displayed after retrieval
Cause: SQL query values are not UTF8 encoded when sent to thedatabase but they
are all decoded once retrieved.
Occurence: Only records with string data that can only be writtenwith UTF8. Itcan be considered rare as people haven't reported this issue after 10years of
usage.
Regional impact: Only affects countries which characters need UTF8encoding and
only affects string values.
Steps to recover from it: Read string data unencoded and write itencoded.
Changes of upgrade pending to be re-released:
SQL query values are both UTF8 encoded when sent to the database aswhen its
retrieved (including BLOB fields).
BLOB fields will be excluded from encoding only if you specify itsdata type.
Side effects from installing upgrade:
- BLOB data will be written after UTF8 encoding and will therefore becorrupt- no possibility to detect if a BLOB field is corrupt or not. Onlywhen known
when the INSERT/UPDATE took place, and when the upgrade was installed
- existing data will still display incorrect
Occurence: every INSERT/UPDATE statement will start writing corruptedBLOB data
Regional impact: worldwide
Steps to recover from it corrupted BLOBs? You cannot. Your binaryblobs areencoded as if they were UTF8 strings. Your binary data isunrecoverable (as in
"gone forever").
If you are a dentist you have to ask your customers to come back tomake another
x-ray as the made photo's are gone.

What is asked from the developer to prevent this from happening?
- do not miss reading the release notes before upgrading
- review all source code (including written by other includedmodules) and
specify the data type of each SQL parameter value
  before: $dbh->do('INSERT INTO test (BLOB1,BLOB2,BLOB3,BLOB4)
VALUES(?,?,?,?)',undef,$col1,$col2,$col3);
after: $dbh->do('INSERT INTO test (BLOB1,BLOB2,BLOB3,BLOB4)VALUES(?,?,?,?)');
          $sth->bind_param(1, $file, SQL_BLOB);
          $sth->bind_param(2, $file, SQL_BLOB);
          $sth->bind_param(3, $file, SQL_BLOB);
          ...
One line more for each SQL statement. This will be a time consumingmonks taskduring which the user will ask why this is necessary while it workedbefore.
- upgrade scripts need to be written to UTF8 encode existing string data
- retest all source code

Re: DBD::mysql path forward

Reply via email to