Hey all,

I am working with JSON that is on the whole fairly clean.  I am trying to
load into Parquet files, and the previous days worth of data worked just
fine, but todays data has something wrong with it and I Can't figure out
what it is. Unfortunately, I can't post the data, which I know makes this
hard to troubleshoot for the community. Hopefully I can provide some info
here, and get some pointers on where to look, and then report back on how
we could potentially improve the error messages.

The error is below.


I am looking to figure out given the information reported where I'd look to
trouble shoot this. Obviously the file 02ffc306e877_my_load_1446640931.json
is where I am looking to start

This file has 3000 lines (records of data, so it's somewhere in between.

The index/length/expected range don't mean anything to me I could use some
help there, because I am not even sure what I am looking for.

The record and/or Fragment... do those help me dig in?

Since this is one record per line, I went to line 2402 but that record
looks completely normal to me, (like all the other ones) but since this is
dense text, I am obviously missing something, but is the record the line
number?

Any other pointers I can use to trouble shoot this?

Thanks!

Error:


Caused by: org.apache.drill.common.exceptions.UserRemoteException:
DATA_READ ERROR: Error parsing JSON - index: 9604, length: 4 (expected:
range(0, 8192))



File
/etl/dev/my-metadata/mysqspull/loads/2015-11-04/02ffc306e877_my_load_1446640931.json

Record  2402

Fragment 1:5

Reply via email to