Khurram Faraaz created DRILL-2985:
-------------------------------------

             Summary: NPE seen for project distinct values from CSV
                 Key: DRILL-2985
                 URL: https://issues.apache.org/jira/browse/DRILL-2985
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Text & CSV
    Affects Versions: 1.0.0
         Environment: b856d0dd50a4495f95ee5d32ea3d396df56842ed | DRILL-2826: 
Simplify and centralize Operator Cleanup | 05.05.2015 @ 20:28:37 EDT
            Reporter: Khurram Faraaz
            Assignee: Steven Phillips


I am seeing a NPE when we project distinct values. Test was run on 4 node 
cluster on CentOS.

{code}
0: jdbc:drill:> select distinct type from `airports.csv`;
Error: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010] 
(state=,code=0)
{code}

Stacktrace from drillbit.log 

{code}
2015-05-07 20:03:21,790 [2ab43af5-b6a9-0578-c45b-454b1a1a7b35:frag:0:0] ERROR 
o.a.d.c.e.DrillRuntimeException - SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
        at 
org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:465)
 ~[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:262)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) 
[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_75]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_75]
        at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
Caused by: java.lang.NullPointerException: null
        at 
org.apache.drill.exec.store.easy.text.compliant.CompliantTextRecordReader.cleanup(CompliantTextRecordReader.java:147)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ScanBatch.<init>(ScanBatch.java:104) 
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.store.dfs.easy.EasyFormatPlugin.getReaderBatch(EasyFormatPlugin.java:189)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.store.dfs.easy.EasyReaderBatchCreator.getBatch(EasyReaderBatchCreator.java:35)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.store.dfs.easy.EasyReaderBatchCreator.getBatch(EasyReaderBatchCreator.java:28)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:140)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getRecordBatch(ImplCreator.java:121)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getChildren(ImplCreator.java:163)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getRootExec(ImplCreator.java:96)
 ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.physical.impl.ImplCreator.getExec(ImplCreator.java:77) 
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:175)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        ... 4 common frames omitted
2015-05-07 20:03:21,792 [2ab43af5-b6a9-0578-c45b-454b1a1a7b35:frag:0:0] INFO  
o.a.drill.exec.work.foreman.Foreman - State change requested.  RUNNING --> 
FAILED
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: null

Fragment 0:0

[Error Id: 9f6e6929-41f6-4821-8a31-8bd45143f3d1 on centos-01.qa.lab:31010]
        at 
org.apache.drill.exec.work.foreman.QueryManager$1.statusUpdate(QueryManager.java:409)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:389)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:90)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:86)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:266)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232)
 [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) 
[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) 
[na:1.7.0_75]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) 
[na:1.7.0_75]
        at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
{code}

Snippet of data from CSV file

{code}
[root@centos-01 logs]# hadoop fs -cat /tmp/airports.csv
"id","ident","type","name","latitude_deg","longitude_deg","elevation_ft","continent","iso_country","iso_region","municipality","scheduled_service","gps_code","iata_code","local_code","home_link","wikipedia_link","keywords"
6523,"00A","heliport","Total Rf 
Heliport",40.07080078125,-74.9336013793945,11,"NA","US","US-PA","Bensalem","no","00A",,"00A",,,
6524,"00AK","small_airport","Lowell 
Field",59.94919968,-151.695999146,450,"NA","US","US-AK","Anchor 
Point","no","00AK",,"00AK",,,
6525,"00AL","small_airport","Epps 
Airpark",34.8647994995117,-86.7703018188477,820,"NA","US","US-AL","Harvest","no","00AL",,"00AL",,,
6526,"00AR","heliport","Newport Hospital & Clinic 
Heliport",35.608699798584,-91.2548980712891,237,"NA","US","US-AR","Newport","no","00AR",,"00AR",,,
6527,"00AZ","small_airport","Cordes 
Airport",34.3055992126465,-112.165000915527,3810,"NA","US","US-AZ","Cordes","no","00AZ",,"00AZ",,,
6528,"00CA","small_airport","Goldstone /Gts/ 
Airport",35.3504981995,-116.888000488,3038,"NA","US","US-CA","Barstow","no","00CA",,"00CA",,,
6529,"00CO","small_airport","Cass 
Field",40.622200012207,-104.34400177002,4830,"NA","US","US-CO","Briggsdale","no","00CO",,"00CO",,,

{code}

details of different columns in the file

{code}
0: jdbc:drill:> select * from `airports.csv` limit 10;
+------------+
|  columns   |
+------------+
| 
["id","ident","type","name","latitude_deg","longitude_deg","elevation_ft","continent","iso_country","iso_region","municipality","scheduled_service","gps_code","iata_code","local_code","home_link","wikipedia_link","keywords"]
 |
| ["6523","00A","heliport","Total Rf 
Heliport","40.07080078125","-74.9336013793945","11","NA","US","US-PA","Bensalem","no","00A","","00A","","",""]
 |
| ["6524","00AK","small_airport","Lowell 
Field","59.94919968","-151.695999146","450","NA","US","US-AK","Anchor 
Point","no","00AK","","00AK","","",""] |
| ["6525","00AL","small_airport","Epps 
Airpark","34.8647994995117","-86.7703018188477","820","NA","US","US-AL","Harvest","no","00AL","","00AL","","",""]
 |
| ["6526","00AR","heliport","Newport Hospital & Clinic 
Heliport","35.608699798584","-91.2548980712891","237","NA","US","US-AR","Newport","no","00AR","","00AR","","",""]
 |
| ["6527","00AZ","small_airport","Cordes 
Airport","34.3055992126465","-112.165000915527","3810","NA","US","US-AZ","Cordes","no","00AZ","","00AZ","","",""]
 |
| ["6528","00CA","small_airport","Goldstone /Gts/ 
Airport","35.3504981995","-116.888000488","3038","NA","US","US-CA","Barstow","no","00CA","","00CA","","",""]
 |
| ["6529","00CO","small_airport","Cass 
Field","40.622200012207","-104.34400177002","4830","NA","US","US-CO","Briggsdale","no","00CO","","00CO","","",""]
 |
| ["6531","00FA","small_airport","Grass Patch 
Airport","28.6455001831055","-82.2190017700195","53","NA","US","US-FL","Bushnell","no","00FA","","00FA","","",""]
 |
| ["6532","00FD","heliport","Ringhaver 
Heliport","28.8465995788574","-82.3453979492188","25","NA","US","US-FL","Riverview","no","00FD","","00FD","","",""]
 |
+------------+
10 rows selected (0.264 seconds)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to