GitHub user jaltekruse opened a pull request:
https://github.com/apache/drill/pull/266
DRILL-4056: Avro corruption bug with UTF-8 strings
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jaltekruse/incubator-drill
4056-avro-corruption-bug
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/266.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #266
----
commit a3e0cbe3820a0350d58c59f374877a12184850e0
Author: Jason Altekruse <[email protected]>
Date: 2015-11-13T23:46:58Z
DRILL-4056: Fix corruption bug reading string data out of Avro
commit 44460fd5a72d6a61b232c335bb8beaaff9daad87
Author: Jason Altekruse <[email protected]>
Date: 2015-11-14T00:26:33Z
DRILL-4056: Part 2 - Cleanup in Avro reader.
Removed use of unnecessary Holder objects. Added restriction on batch
size produced by a single call to next. Did not get a chance to confirm
but it looks like it was reading an entire file into a single batch,
which could have serious performance impacts on very large files.
commit dc084c1255a59aead865e641f952e9e162d4c5e5
Author: Jason Altekruse <[email protected]>
Date: 2015-11-17T23:42:44Z
DRILL-4056: Part 3 - Adding results verification to avro tests.
Task to be finished as part of DRILL-4110.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---