The error message "index: 9604, length: 4 (expected: range(0, 8192))" suggests an error happened when Drill tried to access a memory buffer (most likely while writing an int or float value) This may be a bug actually exposed by that particular data record.
You can try enabling verbose error logging before running the query again: set `exec.errors.verbose`=true; This should give us a nice stack trace about this error. Thanks On Wed, Nov 4, 2015 at 7:29 AM, John Omernik <[email protected]> wrote: > There are multiple fields in that record, including two lists. Both lists > have data in them (now I am runnning with json text mode because at times > the first value is a JSON null, but in these cases, that should be turned > to "null" as string. (If I am understanding things correctly) and > shouldn't be causing a problem. > > > > On Wed, Nov 4, 2015 at 9:21 AM, Hsuan Yi Chu <[email protected]> wrote: > > > What is the data type for that record in line 2402? A list? > > > > Do you think it could be similar to this issue ? > > > > https://issues.apache.org/jira/browse/DRILL-4006 > > > > > > > > On Wed, Nov 4, 2015 at 6:48 AM, John Omernik <[email protected]> wrote: > > > > > Hey all, > > > > > > I am working with JSON that is on the whole fairly clean. I am trying > to > > > load into Parquet files, and the previous days worth of data worked > just > > > fine, but todays data has something wrong with it and I Can't figure > out > > > what it is. Unfortunately, I can't post the data, which I know makes > this > > > hard to troubleshoot for the community. Hopefully I can provide some > info > > > here, and get some pointers on where to look, and then report back on > how > > > we could potentially improve the error messages. > > > > > > The error is below. > > > > > > > > > I am looking to figure out given the information reported where I'd > look > > to > > > trouble shoot this. Obviously the file > > 02ffc306e877_my_load_1446640931.json > > > is where I am looking to start > > > > > > This file has 3000 lines (records of data, so it's somewhere in > between. > > > > > > The index/length/expected range don't mean anything to me I could use > > some > > > help there, because I am not even sure what I am looking for. > > > > > > The record and/or Fragment... do those help me dig in? > > > > > > Since this is one record per line, I went to line 2402 but that record > > > looks completely normal to me, (like all the other ones) but since this > > is > > > dense text, I am obviously missing something, but is the record the > line > > > number? > > > > > > Any other pointers I can use to trouble shoot this? > > > > > > Thanks! > > > > > > Error: > > > > > > > > > Caused by: org.apache.drill.common.exceptions.UserRemoteException: > > > DATA_READ ERROR: Error parsing JSON - index: 9604, length: 4 (expected: > > > range(0, 8192)) > > > > > > > > > > > > File > > > > > > > > > /etl/dev/my-metadata/mysqspull/loads/2015-11-04/02ffc306e877_my_load_1446640931.json > > > > > > Record 2402 > > > > > > Fragment 1:5 > > > > > > -- Abdelhakim Deneche Software Engineer <http://www.mapr.com/> Now Available - Free Hadoop On-Demand Training <http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available>
