[jira] [Commented] (DRILL-3202) Count(*) fails on JSON wrapped up in single array - JSON parsing error
[ https://issues.apache.org/jira/browse/DRILL-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16495641#comment-16495641 ] Adrian-Bogdan Ionescu commented on DRILL-3202: -- You can use SELECT COUNT([ALL] ) FROM dfs.`` to get desired result. Tested in Drill 1.13.0 > Count(*) fails on JSON wrapped up in single array - JSON parsing error > -- > > Key: DRILL-3202 > URL: https://issues.apache.org/jira/browse/DRILL-3202 > Project: Apache Drill > Issue Type: Bug > Components: Storage - JSON >Affects Versions: 1.0.0 >Reporter: Neeraja >Assignee: Steven Phillips >Priority: Major > Fix For: Future > > Attachments: DRILL-3202.patch > > > I have a JSON document as follows. > [ > { > "Category": "1,2", > "Comments": "Total sites: 20, RV sites: 20, Elec sites: 20, Water at > site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: > 545', Tel: 256-577-9619, Nearest town: Muscle Shoals", > "Latitude": "34.800446", > "Longitude": "-87.498242", > "Name": "Alloys Co Park", > "State": "AL", > "Type": "cp", > "URL": > "http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground; > } > ] > Drill has ability to unwrap the array (without user specifying it) and > perform some SQL operations on it. However count(*) specifically fails on > these documents. > 0: jdbc:drill:zk=local> select * from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > +---++---++---++---+--+ > | Category | >Comments > | Latitude | Longitude | Name | > State | Type | URL | > +---++---++---++---+--+ > | 1,2 | Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV > Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: > 256-577-9619, Nearest town: Muscle Shoals | 34.800446 | -87.498242 | Alloys > Co Park | AL | cp | > http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground > | > +---++---++---++---+--+ > 1 row selected (0.197 seconds) > 0: jdbc:drill:zk=local> select distinct type from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > +---+ > | type | > +---+ > | cp| > +---+ > 1 row selected (0.193 seconds) > 0: jdbc:drill:zk=local> > 0: jdbc:drill:zk=local> select count(*) from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > Error: DATA_READ ERROR: Error parsing JSON - Cannot read from the middle of a > record. Current token was START_ARRAY > File /Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json > Record 1 > Fragment 0:0 > [Error Id: 4742f738-1d43-4fef-af48-110065c9dd83 on 172.16.1.82:31010] > (state=,code=0) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (DRILL-3202) Count(*) fails on JSON wrapped up in single array - JSON parsing error
[ https://issues.apache.org/jira/browse/DRILL-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16448752#comment-16448752 ] Scott Wilburn commented on DRILL-3202: -- Is there any workaround for this issue? It's hard to believe that I can't count objects in a json file using Drill. > Count(*) fails on JSON wrapped up in single array - JSON parsing error > -- > > Key: DRILL-3202 > URL: https://issues.apache.org/jira/browse/DRILL-3202 > Project: Apache Drill > Issue Type: Bug > Components: Storage - JSON >Affects Versions: 1.0.0 >Reporter: Neeraja >Assignee: Steven Phillips >Priority: Major > Fix For: Future > > Attachments: DRILL-3202.patch > > > I have a JSON document as follows. > [ > { > "Category": "1,2", > "Comments": "Total sites: 20, RV sites: 20, Elec sites: 20, Water at > site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: > 545', Tel: 256-577-9619, Nearest town: Muscle Shoals", > "Latitude": "34.800446", > "Longitude": "-87.498242", > "Name": "Alloys Co Park", > "State": "AL", > "Type": "cp", > "URL": > "http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground; > } > ] > Drill has ability to unwrap the array (without user specifying it) and > perform some SQL operations on it. However count(*) specifically fails on > these documents. > 0: jdbc:drill:zk=local> select * from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > +---++---++---++---+--+ > | Category | >Comments > | Latitude | Longitude | Name | > State | Type | URL | > +---++---++---++---+--+ > | 1,2 | Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV > Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: > 256-577-9619, Nearest town: Muscle Shoals | 34.800446 | -87.498242 | Alloys > Co Park | AL | cp | > http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground > | > +---++---++---++---+--+ > 1 row selected (0.197 seconds) > 0: jdbc:drill:zk=local> select distinct type from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > +---+ > | type | > +---+ > | cp| > +---+ > 1 row selected (0.193 seconds) > 0: jdbc:drill:zk=local> > 0: jdbc:drill:zk=local> select count(*) from > dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` > limit 10; > Error: DATA_READ ERROR: Error parsing JSON - Cannot read from the middle of a > record. Current token was START_ARRAY > File /Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json > Record 1 > Fragment 0:0 > [Error Id: 4742f738-1d43-4fef-af48-110065c9dd83 on 172.16.1.82:31010] > (state=,code=0) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (DRILL-3202) Count(*) fails on JSON wrapped up in single array - JSON parsing error
[ https://issues.apache.org/jira/browse/DRILL-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14615989#comment-14615989 ] Steven Phillips commented on DRILL-3202: Created reviewboard https://reviews.apache.org/r/36233/ Count(*) fails on JSON wrapped up in single array - JSON parsing error -- Key: DRILL-3202 URL: https://issues.apache.org/jira/browse/DRILL-3202 Project: Apache Drill Issue Type: Bug Components: Storage - JSON Affects Versions: 1.0.0 Reporter: Neeraja Assignee: Steven Phillips Fix For: 1.2.0 Attachments: DRILL-3202.patch I have a JSON document as follows. [ { Category: 1,2, Comments: Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals, Latitude: 34.800446, Longitude: -87.498242, Name: Alloys Co Park, State: AL, Type: cp, URL: http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground; } ] Drill has ability to unwrap the array (without user specifying it) and perform some SQL operations on it. However count(*) specifically fails on these documents. 0: jdbc:drill:zk=local select * from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; +---++---++---++---+--+ | Category | Comments | Latitude | Longitude | Name | State | Type | URL | +---++---++---++---+--+ | 1,2 | Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals | 34.800446 | -87.498242 | Alloys Co Park | AL | cp | http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground | +---++---++---++---+--+ 1 row selected (0.197 seconds) 0: jdbc:drill:zk=local select distinct type from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; +---+ | type | +---+ | cp| +---+ 1 row selected (0.193 seconds) 0: jdbc:drill:zk=local 0: jdbc:drill:zk=local select count(*) from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; Error: DATA_READ ERROR: Error parsing JSON - Cannot read from the middle of a record. Current token was START_ARRAY File /Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json Record 1 Fragment 0:0 [Error Id: 4742f738-1d43-4fef-af48-110065c9dd83 on 172.16.1.82:31010] (state=,code=0) -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (DRILL-3202) Count(*) fails on JSON wrapped up in single array - JSON parsing error
[ https://issues.apache.org/jira/browse/DRILL-3202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14563253#comment-14563253 ] Chun Chang commented on DRILL-3202: --- possible duplicate of DRILL-2385 Count(*) fails on JSON wrapped up in single array - JSON parsing error -- Key: DRILL-3202 URL: https://issues.apache.org/jira/browse/DRILL-3202 Project: Apache Drill Issue Type: Bug Components: Storage - JSON Affects Versions: 1.0.0 Reporter: Neeraja Assignee: Steven Phillips I have a JSON document as follows. [ { Category: 1,2, Comments: Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals, Latitude: 34.800446, Longitude: -87.498242, Name: Alloys Co Park, State: AL, Type: cp, URL: http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground; } ] Drill has ability to unwrap the array (without user specifying it) and perform some SQL operations on it. However count(*) specifically fails on these documents. 0: jdbc:drill:zk=local select * from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; +---++---++---++---+--+ | Category | Comments | Latitude | Longitude | Name | State | Type | URL | +---++---++---++---+--+ | 1,2 | Total sites: 20, RV sites: 20, Elec sites: 20, Water at site, RV Dump, Showers, Flush Toilets, RV Fee: $14, Tent Fee: $14, Elev: 545', Tel: 256-577-9619, Nearest town: Muscle Shoals | 34.800446 | -87.498242 | Alloys Co Park | AL | cp | http://www.campingroadtrip.com/campgrounds/campground/campground/23478/alabama/colbert-county-alloys-park-campground | +---++---++---++---+--+ 1 row selected (0.197 seconds) 0: jdbc:drill:zk=local select distinct type from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; +---+ | type | +---+ | cp| +---+ 1 row selected (0.193 seconds) 0: jdbc:drill:zk=local 0: jdbc:drill:zk=local select count(*) from dfs.`default`.`/Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json` limit 10; Error: DATA_READ ERROR: Error parsing JSON - Cannot read from the middle of a record. Current token was START_ARRAY File /Users/nrentachintala/Downloads/yelp/uspointsofinterestshort.json Record 1 Fragment 0:0 [Error Id: 4742f738-1d43-4fef-af48-110065c9dd83 on 172.16.1.82:31010] (state=,code=0) -- This message was sent by Atlassian JIRA (v6.3.4#6332)