Hi,
I am trying to access HDFS data sources running in a remote instance from
apache drill running in my laptop. But I get the following error :
2015-08-13 06:06:48,287 [Client-1] INFO
o.a.d.j.i.DrillResultSetImpl$ResultsListener - [#2] Query failed:
org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
BlockMissingException: Could not obtain block:
BP-67039680-172.17.0.1-1434643788667:blk_1073820202_79472
file=/tmp/test.json
I am able to view the files via the datasource, but
cannot issue a select * query. Can you please help? PFA the data source
configuration.
Thanks,
Malathi
{
"type": "file",
"enabled": true,
"connection": "hdfs://172.16.0.208:8020",
"workspaces": {
"root": {
"location": "/",
"writable": false,
"defaultInputFormat": null
},
"tmp": {
"location": "/tmp",
"writable": false,
"defaultInputFormat": null
}
},
"formats": {
"psv": {
"type": "text",
"extensions": [
"tbl"
],
"delimiter": "|"
},
"csv": {
"type": "text",
"extensions": [
"csv"
],
"delimiter": ","
},
"tsv": {
"type": "text",
"extensions": [
"tsv"
],
"delimiter": "\t"
},
"parquet": {
"type": "parquet"
},
"json": {
"type": "json"
},
"avro": {
"type": "avro"
}
}
}