This is a known problem. I would recommend the alternate implementation called
fastavro which is indeed faster.
Sam
On Thursday, March 24, 2016 4:05 PM, "Hong, Shunhui Welly"
<[email protected]> wrote:
<!--#yiv7600624740 _filtered #yiv7600624740 {font-family:Calibri;panose-1:2
15 5 2 2 2 4 3 2 4;} _filtered #yiv7600624740 {font-family:Consolas;panose-1:2
11 6 9 2 2 4 3 2 4;}#yiv7600624740 #yiv7600624740 p.yiv7600624740MsoNormal,
#yiv7600624740 li.yiv7600624740MsoNormal, #yiv7600624740
div.yiv7600624740MsoNormal
{margin:0in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri",
"sans-serif";}#yiv7600624740 a:link, #yiv7600624740
span.yiv7600624740MsoHyperlink
{color:blue;text-decoration:underline;}#yiv7600624740 a:visited, #yiv7600624740
span.yiv7600624740MsoHyperlinkFollowed
{color:purple;text-decoration:underline;}#yiv7600624740 code
{font-family:Consolas;background:#F0F0F0;}#yiv7600624740
span.yiv7600624740EmailStyle17 {font-family:"Calibri",
"sans-serif";color:windowtext;}#yiv7600624740 .yiv7600624740MsoChpDefault
{font-family:"Calibri", "sans-serif";} _filtered #yiv7600624740 {margin:1.0in
1.0in 1.0in 1.0in;}#yiv7600624740 div.yiv7600624740WordSection1 {}-->Hi guys,
I am trying to decode avro binary data (55M took 5 mins) in Python. It seems
10+ times slower than in Java. Any suggestions? bytes_reader =
io.BytesIO(raw_bytes) decoder = avro.io.BinaryDecoder(bytes_reader) reader =
avro.io.DatumReader(schema) decoded_data = reader.read(decoder) Thank you,
Welly This email is confidential and subject to important disclaimers and
conditions including on offers for the purchase or sale of securities, accuracy
and completeness of information, viruses, confidentiality, legal privilege, and
legal entity disclaimers, available at
http://www.jpmorgan.com/pages/disclosures/email